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(57) Abstract: The present invention disclosed isolated nucleic acid molecules (polynucleotides) which encode NHL, a putative 
DNA helicase. The present invention in turn relates to recombinant vectors and recombinant hosts which contain a DNA fragment 
encoding NHL, substantially puriSed forms of associated NHL, associated mutant proteins, and methods associated with identifying 
compounds which modulate NHL, which will be useful in the treatment of various neoplastic disorders. Both a genomic clone 
containing regulatory and intron sequences, as well as the exon structure and open reading frame of human NHL are disclosed. 
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TITLE OF THE INVENTION . 

DNA MOLECULES ENCODING HUMAN NHL, A DNA HELIC ASE 

10 - CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit, under 35 U.S.C. §1 19(e), of U.S. 
provisional application 60/169,970 filed December 9, 1999. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
15 Not Applicable 

REFERENCE TO MICROFICHE APPENDIX 
Not Applicable 

20 

FIELD OF THE INVENTION 

The present invention relates in part to isolated nucleic acid molecules 
(polynucleotides) which encode NHL, a putative DNA helicase. The present 
invention also relates to recombinant vectors and recombinant hosts which contain a 

25 DNA fragment encoding NHL, substantially purified forms of associated NHL, 
associated mutant proteins, and methods associated with identifying compounds 
which modulate NHL, which will be useful in the treatment of various neoplastic 
disorders, given that this gene is located at 20ql3.3 and immediately adjacent to 
M68/DcR3, which is involved in tumor growth. Also included within the present 

30 invention is a human genomic fragment representing this portion of the human 
genome, along with three additional genes (M68/DcR3, SCLIP, and ARP). 
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BACKGROUND OF THE INVENTION 

Naumovski et al. (1985, Mol. Cell Biol 5:17-26; Reynolds et al. (1985 Nucleic 
5 Acid Res 13:2357-2372) and Weber et al. (1990 EMBOJ. 9:1437-1447) disclose 

members of the RAD3/ERCC2 gene family of DNA helicases. 

It is known that several chemotherapeutic agents inhibit helicases, including 

actinomycin CI, daunorubicin and nogalamycih (Tuteja, et al., 1997, Biochem. 

Biophys. Res. Comm. 236(3):636-640), and a prostate cancer drug, CI-958 (Lun, et 
10 al.,1998, Cancer Chemother. Pharmacol. 42(6):447-453). In addition, some 

topoisomerases have been shown to have anti-cancer activity. 

Despite the identification of the aforementioned helicase-encoding genes and 

chemotherapeutic agents, it would be advantageous to identify additional genes which 

reside within chromosomal regions associated with a disease state such as cancer as 
15 well as a gene which encodes a type of protein which may be associated with that 

disease. The present invention addresses and meets this need by disclosing a DNA 

molecule encoding a DNA helicase with a chromosomal location suggestive of 

association with cancer. 

20 SUMMARY OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
(polynucleotide) which encodes a novel mammalian DNA helicase. 

The present invention also relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
25 helicase, NHL. 

A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure I A-B and SEQ ID NO: 1 . 

The present invention also relates to biologically active fragments or mutants 
30 of SEQ ID-NO: 1 which encode a mRNA molecule expressing a novel DNA helicase, 
NHL. Any such biologically active fragment and/or mutant will encode either a 
protein or protein fragment which at least substantially mimics the biological 
properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 
SEQ ID NO:2. Any such polynucleotide includes but is not necessarily limited to 
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nucleotide substitutions, deletions, additions, amino-terminai truncations and earboxy- 
terminal truncations such that these mutations encode mRNA which express a 
functional NHL protein in a host cell, so as to be useful for screening for agonists 
and/or antagonists of NHL activity. 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which. contain the substantially purified 
nucleic acid molecules disclosed throughout this specification. 

The present invention also relates to a substantially purified form of a human 
NHL protein which comprises the amino acid sequence disclosed in Figure 2 and set 
10 forth as SEQ ID NO:2. 

A preferred aspect of this portion of the present invention is a NHL protein 
which consists of the amino acid sequence disclosed in Figure 2 and set forth as SEQ 
ID NO:2. 

Another preferred aspect of the present invention relates to a substantially 
15 purified NHL protein, preferably a human NHL protein, obtained from a recombinant 
host cell containing a DNA expression vector comprises a nucleotide sequence as set 
forth in SEQ ID NO:l and expresses the respective NHL protein. It is especially 
preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

20 The present invention also relates to biologically active fragments and/or 

mutants of a NHL protein comprising the amino acid sequence as set forth in SEQ ID 
NO:2, including but not necessarily limited to amino acid substitutions, deletions, 
additions, amino terminal truncations and carboxy-terminal truncations such that these 
mutations provide for proteins or protein fragments of diagnostic, therapeutic or 

25 prophylactic use and would be useful for screening for selective modulators, including 
but not limited to agonists and/or antagonists for human NHL pharmacology. 

A preferred aspect of the present invention is disclosed in Figure 2 and is set 
forth; as SEQ ID NO:2, a respective amino acid sequence which encodes human NHL. 
Characterization of one or more of these DNA helicase-Iike proteins allows for 

30 screening methods to identify novel NHL modulators that may be useful in the 
treatment of human neoplastic disorders. The modulators selected through such 
screening and selection protocols may be used alone or in conjunction with other 
cancer therapies. As noted above, heterologous expression. of a NHL protein will 
allow the pharmacological analysis of compounds which modulate NHL activity and 
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hence may be useful in various cancer therapies. To this end, heterologous cell lines 
expressing a NHL protein can be used to establish functional or binding assays to 
identify novel NHL modulators. 

The present invention also relates to polyclonal and monoclonal antibodies 

5 raised in response to either the NHL or a biologically active fragment of NHL. 

The present invention relates to transgenic mice comprising altered genotypes 
and phenotypes in relation to NHL and its in vivo activity. 

The present invention also relates to NHL fusion constructs, including but not 
limited to fusion constructs which express a portion of the NHL protein linked to 

10 various markers, including but in no way limited to GFP (Green fluorescent protein), 
the MYC epitope, and GST. Any such fusion constructs may be expressed in the cell 
line of interest and used to screen for NHL modulators. 

Therefore, the present invention relates to methods of expressing mammalian 
NHL, and preferably human NHL, biological equivalents disclosed herein, assays 

15 employing these gene products, recombinant host cells which comprise DNA 
constructs which express these proteins, and compounds identified through these 
assays which act as agonists or antagonists of NHL activity. 

The present invention also relates to the isolated genomic sequence which 
comprises SEQ ID NO:i, a 115 kb genomic fragment set forth herein as SEQ ID 

20 NO:3. As especially preferred aspect of this portion of the invention is the region of 
the genomic fragment of SEQ ID NO:3 which comprises the regulatory and coding 
regions of human NHL, as well as intervening sequences (introns). This 1 15 kb 
fragment contains at least the coding region of four genes, NHL, M68/DcR3, SCLIP 
and ARP. As discussed herein, it has been shown that this region of chromosome 20 

25 is associated with tumor growth. Therefore, an aspect of this invention also comprises 
the use of one or more regions of this 115 kb genomic sequence to identify 
compounds which up or downregulate expression of one or more of the genes 
localized within this L 15 kb region, wherein this up or down regulation results in an 
interference of tumor growth. For example, a transcription element of one of these 

30 four genes may be responsible for M68/DcR3 ( and/or NHL) overexpression in 

tumors, and if M68 or NHL overexpression in tumors has a caustic role, blockage of 
M68/DcR3 or NHL overexpression in tumors by interfering with this transcription site 
will be useful. 
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It is an object of the present invention to provide an isolated nucleic acid 
molecule (e.g., SEQ ID NO:l) which encodes novel form of human NHL, or 
fragments, mutants or derivatives of human NHL as set forth in Figure 2 and SEQ ID 
NO:2. Any such polynucleotide includes but is not necessarily limited to nucleotide 
5 substitutions, deletions, additions, amino-terminal truncations and carboxy-terminal 
truncations such that these mutations encode mRNA which express a protein or 
protein fragment of diagnostic, therapeutic or prophylactic use and would be useful 
for screening for selective modulators of human NHL activity. 

It is a further object of the present invention to provide the mammalian, and 
10 especially human, NHL proteins or protein fragments encoded by the nucleic acid 
molecules referred to in the preceding paragraph. 

It is a further object of the present invention to provide recombinant vectors 
and recombinant host cells which comprise a nucleic acid sequence encoding 
mammalian, and especially human, NHL protein and biological equivalent thereof. 
15 It is an object of the present invention to provide a substantially purified form 

of human NHL, as set forth in Figure 2 and SEQ ID NO:2. 

Is another object of the present invention to provide a substantially purified 
recombinant form of a NHL protein which has been obtained from a recombinant host 
cell transformed or transfected with a DNA expression vector which comprises and 
20 appropriately expresses a complete open reading frame as set forth in SEQ ID NO:l, 
resulting in a functional, processed form of NHL. It is especially preferred is that the 
recombinant host cell be a eukaryotic host cell, such as a mammalian cell line. 

It is an object of the present invention to provide for biologically active 
fragments and/or mutants of mammalian, and especially human, NHL, such as set 
25 forth in SEQ ID NO:2, including but not necessarily limited to amino acid 

substitutions, deletions, additions, amino terminal truncations and carboxy-terminal 
truncations such that these mutations provide for proteins or protein fragments of 
diagnostic, therapeutic and/or prophylactic use. 

It is also an object of the present invention to use NHL proteins or biological 
30 equivalent to screen for modulators, preferably selective modulators,. of human NHL 
activity. Any such compound may be useful in screening for and selecting compounds 
active against human neoplastic disorders. 

As used herein, "substantially free from other nucleic acids" means at least 
90%, preferably 95%,. more preferably 99%, and even more preferably 99.9%, free of 
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other nucleic acids. Thus, a human NHL DNA preparation that is substantially free 
from other nucleic acids will contain, as a percent of its total nucleic acid, no more 
than 10%, preferably no more than 5%, more preferably no more than 1%, and even 
more preferably no more than 0.1%, of non-NHL nucleic acids. Whether a given 

5 NHL DNA preparation is substantially free from other nucleic acids can be 

determined by such conventional techniques of assessing nucleic acid purity as, e.g., 
agarose gel electrophoresis combined with appropriate staining methods, e.g., 
ethidium bromide staining, or by sequencing. 

As used herein, "substantially free from other proteins" or "substantially 

10 purified" means at least 90%, preferably 95%, more preferably 99%, and even more 
preferably 99.9%, free of other proteins. Thus, a NHL protein preparation that is 
substantially free from other proteins will contain, as a percent of its total protein, no 
more than 10%, preferably no more than 5%, more preferably no more than 1%, and 
even more preferably no more than 0.1%, of non-NHL proteins. Whether a given 

15 NHL protein preparation is substantially free from other proteins can be determined 
by such conventional techniques of assessing protein purity as, e.g., sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE) combined with appropriate 
detection methods, e.g., silver staining or immunoblotting. As used interchangeably 
with the terms "substantially free from other proteins" or "substantially purified", the 

20 terms "isolated NHL protein" or "purified NHL protein" also refer to NHL protein that 
has been isolated from a natural source. Use of the term "isolated" or "purified" 
indicates that NHL protein has been removed from its normal cellular environment. 
Thus, an isolated NHL protein may be in a cell-free solution or placed in a different 
cellular environment from that in which it occurs naturally. The term isolated does 

25 not imply that an isolated NHL protein is the only protein present, but instead means 
that an isolated NHL protein is substantially free of other proteins and non-amino acid 
material (e.g., nucleic acids, lipids, carbohydrates) naturally associated with the NHL 
protein in vivo. Thus, a NHL protein that is recombinantly expressed in a prokaryotic 
or eukaryotic cell and substantially purified from this host cell which does not 

30 naturally {i.e., without intervention) express this protein is of course "isolated NHL 
protein" under any circumstances referred to herein. As noted above, a NHL protein 
preparation that is an isolated or purified NHL protein will be substantially free from 
other proteins will contain, as a percent of its total protein, no more than 10%, 
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preferably no more than 5%, more preferably no more than 1%, and even more 
preferably no more than 0.1%, of non-NHL proteins. 

As used interchangeably herein, "functional equivalent" or "biologically active 
equivalent" means a protein which does not have exactly the same amino acid 

5 sequence as naturally occurring NHL, due to alternative splicing, deletions, mutations, 
substitutions, or additions, but retains substantially the same biological activity as 
NHL. Such functional equivalents will have significant amino acid sequence identity 
with naturally occurring NHL and genes and cDNA encoding such functional 
equivalents can be detected by reduced stringency hybridization with a DNA sequence 

10 encoding naturally occurring NHL. For example, a naturally occurring NHL disclosed 
herein comprises the amino acid sequence shown as SEQ ED NO:2 and is encoded by 
SEQ ID NO:L A nucleic acid encoding a functional equivalent has at least about 
50% identity at the nucleotide level to SEQ ID NO: 1. 

As used herein, "a conservative amino acid substitution" refers to the 

15 replacement of one amino acid residue by another, chemically similar, amino acid 
residue. Examples of such conservative substitutions are: substitution of one 
hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., 
arginine for lysine; glutamic acid for aspartic acid). 

20 As used herein, the term "mammalian" will refer to any mammal, including a 

human being. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 A-B shows the nucleotide sequence which comprises the open reading 
25 frame which encodes human NHL, the nucleotide sequence set forth as SEQ ED NO:l. 
The initiating Met residue (ATG) and the stop codon (TAG) are underlined. 

Figure 2 shows the amino acid sequence of human NHL as set forth in SEQ ID 

NO:2. 

Figure 3 shows the alignment of amino acid sequences of human NHL to 
30 ERCC2/RAD3 gene family members. Rep D (Dictyosteliem discoideum); RAD 3 
(S. cerevisiae)\ RAD 15 (5. pombe) and XP_GroupD {Homo sapien). 

Figure 4 shows Northern analysis of NHL expression in multi-human tissues. 

Figure 5 A-B show the genomic structure of the NHL gene (Figure 5 A) and the 
entire 115 kb genomic region (Figure 5B) containing the NHL, M68/DcR3, SCLff 
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and AJRP genes. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated or purified nucleic acid molecule 
5 (polynucleotide), which encodes a novel mammalian DNA. helicase. An especially 
preferred aspect of this invention relates to an isolated nucleic acid molecule 
(polynucleotide) which encodes mRNA which expresses a novel human DNA 
helicase, NHL. 

The gene M68/DcR3 is a secreted TNFR member that is overexpressed in a 

10 number of human tumors. M68/DcR3 is located at 20ql3.3, a known site that is 
associated with frequent gene amplification in cancer. M68/DcR3 protein binds, to 
FASL and inhibit FAS mediated apoptosis. Thus, genes tightly linked to M68/DcR3 
may be coregulated (e.g. co overexpressed and/or amplified in tumors). During the 
course of cloning the genomic M68/DcR3 fragment and identifying genes that are 

15 linked to M68/DcR3 at 20q.l3.3, three genes, including a novel gene that is similar to 
the Rad3/ERCC2 helicase family, were identified (termed NHL) in the immediately 
adjacent (overlapping) region. Given NHL's chromosomal location and the frequent 
association of DNA helicases with human genetic disorders (mutations in DNA 
helicases have been found associated with multiple diseases, including xeroderma 

20 pigmentosum, Cockayne's syndrome, Bloom's syndrome, and Werner's syndrome), 
NHL is a candidate for contribution to certain human neoplastic disorders. To this 
end, the genomic clone for this gene is disclosed and the complete sequence is 
determined. The transcript was identified through exon prediction using GRAIL2 and 
sequence alignment to a contiguous 4.5 kilobase region of chromosome 4 (88% 

25 sequence identity). The complete exon structure of NHL was subsequently confirmed 
by RT-PCR analysis. Multiple sequence alignment of NHL to known helicases 
showed that NHL contains all the seven critical helicase domains. BLAST analysis of 
the predicted 1,219 amino acid sequence revealed an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 

30 helicases (Naumovski et al. f 1985 Moi Cell Biol. 5:17-26; Reynolds et al., 1985 
Nucleic Acid Res 13:2357-72; Weber et al., 1990 EMBO J. 9:1437-1447). The 
mRNA expression pattern of NHL was also examined in multiple human tissues. 
Radiation hybrid chromosomal mapping reconfirms that it is linked to M68/DcR3 
locus. 
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A preferred aspect of the present invention relates to an isolated or purified 
DNA molecule which encodes human NHL, the nucleotide sequence as set forth in 
Figure 1A-B and SEQ ID NO:l, which is as follows: 

AGTGAGCCCT GCTGCCAGCC AGTGCCGGGT GCTGGGGACT CAGGGAGGCC CGCCGGGACC 
5 ACTGCGGGAC AGTGAGCCGA GCAGAAGCTG GAACGCAGGA GAGGAAGGAG AGGGGGCGGT 
CAGGGCTCTC AGGAGCCGGG TCCTGGGCAA GGCGCAGCCG TTTTCAAATT TTCAGGAAAG 
CGGTCGGCTC ACACTCGAGC AGTAAAAAGA TGCCTCTGGG. GAGGAGGCCC GTGCAGCTCT 
CCGGGCAATG GTGGTGGCTC GGCCTAGAGA GGCGGTAGTG GAACGCAGAC CCTGGTGGGG 
GAATGACATC AAGGGAGGAG ACGGGCGGGA CCCCAGATTT CTGCCTGTGG GCGATGGAAG 
10 TGAGGTTCAC TGGCCAGCGG AGCCGGACAC AGAACGCGCA AAACGCCGTG TAGGCCTGGA 
GGAGCCGAAG AGCAGGCGGA CCCCCTCCGC GGGGGAACAG TTTCCGCCGG GAGCACAAAG 
CAACGGACCG GAAGTGGGGG GCGGAAGTGC AGTGGGCTCA GCGCCGACTG CGCGCCTCTG 
CCCGCGAAAA CTCTGAGCTG GCTGACAGCT GGGGACGGGT GGCGGCCCTC GACTGGAGTC 
GGTTGAGTTC CTGAGGGACC CCGGTTCTGG AAGGTTCGCC GCGGAGACAA GTGAGCAGTC 
15 TGTGCCATAG GGATTCTCGA AGAGAACAGC GTTGTGTCCC AGTGCACATG CTCGCATCGC 
TTACCAGGAG TGCCCGAGAC CCTAAGATGT TCGGAGTGGT TTTTTCGCAC AGACCCGAAT 
AGCCTGCCCC TCAGCCACGC TCTGTGCCCT TCTGAGAACA GGCTGATATG CCCAAGATAG 
TCCTGAATGG TGTGACCGTA GACTTCCCTT TCCAGCCCTA CAAATGCCAA CAGGAGTACA 
TGACCAAGGT CCTGGAATGT CTGCAGCAGA AGGTGAATGG CATCCTGGAG AGCCCTACGG 
20 GTACAGGGAA GACGCTGTGC CTGCTGTGCA CCACGCTGGC CTGGCGAGAA CACCTCCGAG 
ACGGCATCTC TGCCCGCAAG ATT.GCCGAGA GGGCGCAAGG AGAGCTTTTC CCGGATCGGG 
CCTTGTCATC CTGGGGCAAC GCTGCTGCTG CTGCTGGAGA CCCCATAGCT TGCTACACGG 
• ACATCCCAAA GATTATTTAC GCCTCCAGGA CCCACTCGCA ACTCACACAG GTCATCAACG 
AGCTTCGGAA CACCTCCTAC CGGCCTAAGG TGTGTGTGCT GGGCTCCCGG GAGCAGCTGT 
25 GCATCCATCC TGAGGTGAAG AAACAAGAGA GTAACCATCT ACAGATCCAC TTGTGCCGTA 
AGAAGGTGGC AAGTCGCTCC TGTCATTTCT ACAACAACGT AGAAGAAAAA AGCCTGGAGC 
AGGAGCTGGC CAGCCCCATC CTGGACATTp AGGACTTGGT CAAGAGCGGA AGCAAGCACA 
GGGTGTGCCC TTACTACCTG TCCCGGAACC TGAAGCAGCA AGCCGACATC ATATTCATGC 
CGTACAATTA CTTGTTGGAT GCCAAGAGCC GCAGAGCACA CAACATTGAC CTGAAGGGGA 
30 CAGTCGTGAT CTTTGACGAA GCTCACAACG TGGAGAAGAT GTGTGAAGAA TCGGCATCCT : 
TTGACCTGAC TCCCCATGAC CTGGCTTCAG GACTGGACGT CATAGACCAG GTGCTGGAGG 
AGCAGACCAA GGCAGCGCAG CAGGGTGAGC CCCACCCGGA GTTCAGCGCG GACTCCCCCA . 
GCCCAGGGCT GAACATGGAG CTGGAAGACA TTGCAAAGCT GAAGATGATC CTGCTGCGCC 
TGGAGGGGGC CATCGATGCT GTTGAGCTGC CTGGAGACGA CAGCGGTGTC ACCAAGCCAG 
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GGAGCTACAT CTTTGAGCTG TTTGCTGAAG CCCAGATCAC GTTTCAGACC AAGGGCTGCA 
TCCTGGACTC GCTGGACCAG ATCATCCAGC ACCTGGCAGG ACGTGCTGGA GTGTTCACCA 
ACACGGCCGG ACTGCAGAAG CTGGCGGACA TTATCCAGAT TGTGTTCAGT GTGGACCCCT 
CCGAGGGCAG CCCTGGTTCC CCAGCAGGGC T.GGGGGCCTT ACAGTCCTAT AAGGTGCACA 
5 TCCATCCTGA TGCTGGTCAC CGGAGGACGG CTCAGCGGTC TGATGCCTGG AGCACCACTG 
CAGCCAGAAA GCGAGGGAAG GTGCTGAGCT ACTGGTGCTT CAGTCCCGGC CACAGCATGC 
ACGAGCTGGT CCGCCAGGGC GTCCGCTCCC TCATCCTTAC CAGCGGCACG CTGGCCCCGG 
TGTCCTCCTT TGCTCTGGAG ATGCAGATCC CTTTCCCAGT CTGCCTGGAG AACCCACACA 
TCATCGACAA GCACCAGATC TGGGTGGGGG TCGTCCCCAG AGGCCCCGAT GGAGCCCAGT 
10 TGAGGTCCGC GTTTGACAGA CGGTTTTCCG AGGAGTGCTT ATCCTCCCTG GGG AAGGCTC 
TGGGCAACAT CGCCCGCGTG GTGCCCTATG GGCTCCTGAT CTTCTTCCCT TCCTATCCTG 
TCATGGAGAA GAGCCTGGAG TTCTGGCGGG CCCGCGACTT GGCCAGGAAG ATGGAGGCGC 
TGAAGCCGCT GTTTGTGGAG CCCAGGAGCA AAGGCAGCTT CTCCGAGACC ATCAGTGCTT 
ACTATGCAAG GGTTGCCGCC CCTGGGTCCA CCGGCGCCAC CTTCCTGGCG GTCTGCCGGG 
15 GCAAGGCCAG CGAGGGGCTG GACTTCTCAG ACACGAATGG CCGTGGTGTG ATTGTCACGG 
GCCTCCCGTA CCCCCCACGC ATGGACCCCC GGGTTGTCCT CAAGATGCAG TTCCTGGATG 
" AGATGAAGGG CCAGGGTGGG GCTGGGGGCC AGTTCCTCTC TGGGCAGGAG TGGTACCGGC 
AGCAGGCGTC CAGGGCTGTG AACCAGGCCA TCGGGCGAGT GATCCGGCAC CGCCAGGACT 
ACGGAGCTGT CTTCCTCTGT GACCACAGGT TCGCCTTTGC CGACGCAAGA GCCCAACTGC 
20 CCTCCTGGGT GCGTCCCCAC GTCAGGGTGT ATGACAACTT TGGCCATGTC ATCCGAGACG 
TGGCCCAGTT CTTCCGTGTT GCCGAGCGAA CTATGCCAGC GCCGGCCCCC CGGGCTACAG 
CACCCAGTGT GCGTGGAGAA GATGCTGTCA GCGAGGCGAA GTCGCCTGGC CCCTTCTTCT 
CCACCAGGAA AGCTAAGAGT CTGGACCTGC ATGTCCCCAG CCTGAAGCAG AGGTCCTCAG 
GGTCACCAGC TGCCGGGGAC CCCGAGAGTA GCCTGTGTGT GGAGTATGAG CAGGAGCCAG 
25 TTCCTGCCCG GCAGAGGCCC AGGGGGCTGC TGGCCGCCCT GGAGCACAGC GAACAGCGGG 
CGGGGAGCCC TGGCGAGGAG CAGGCCCACA GCTGCTCCAC CCTGTCCCTC CTGTCTGAGA 
AGAGGCCGGC AGAAGAACCG CGAGGAGGGA GGAAGAAGAT CCGGCTGGTC AGCCACCCGG 
AGGAGCCCGT GGCTGGTGCA CAGACGGACA GGGCCAAGCT CTTCATGGTG GCCGTGAAGC 
AGGAGTTGAG CCAAGCCAAC TTTGCCACCT TCACCCAGGC CCTGCAGGAC TACAAGGGTT 
30 CCGATGACTT CGCCGCCCTG GCCGCCTGTC TCGGCCCCCT CTTTGCTGAG GACCCCAAGA 
AGCACAACCT GCTCCAAGGC TTCTACCAGT TTGTGCGGCC CCACCATAAG CAGCAGTTTG 
AGGAGGTCTG TATCCAGCTG ACAGGACGAG GCTGTGGCTA TCGGCCTGAG CACAGCATTC 
CCCGAAGGCA GCGGGCACAG CCGGTCCTGG ACCCGACTGG AAGAACGGCG CCGGATCCCA 
AGCTGACCGT GTCCACGGCT GCAGCCCAGC. AGCTGGACCC CCAAGAGCAC CTGAACCAGG 
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GCAGGCCCCA CCTGTCGCCC AGGCCACCCC CAACAGGAGA CCCTGGCAGC CAACCACAGT 
GGGGGTCTGG AGTGCCCAGA GCAGGGAAGC AGGGCCAGCA CGCCGTGAGC GCCTACCTGG 
CTGATGCCCG CAGGGCCCTG GGGTCCGCGG GCTGTAGCCA ACTCTTGGCA GCGCTGACAG 
CCTATAAGCA AGACGACGAC CTCGACAAGG TGCTGGCTGT GTTGGCCGCC CTGACCACTG 

5 CAAAGCCAGA GGACTT.CCCC CTGCTGCACA GGTTCAGCAT GTTTGTGCGT CCACACCACA 
AGCAGCGCTT CTCACAGACG TGCACAGACC TGACCGGCCG GCCCTACCCG GGCATGGAGC 
CACCGGGACC CCAGGAGGAG AGGCTTGCCG TGCCTCCTGT GCTTACCCAC AGGGCTCCCC 
AACCAGGCCC CTCACGGTCC GAGAAGACCG GGAAGACCCA GAGCAAGATC TCGTCCTTCC 
TTAGACAGAG GCCAGCAGGG ACTGTGGGGG CGGGCGGTGA GGATGCAGGT CCCAGCCAGT 

10 CCTCAGGACC TCCCCACGGG CCTGCAGCAT CTGAGTGGGG CCTCTAGGAT GTGCCCAGCC 
TGCCACACCG CCTCCAGGAA GCAGAGCGTC ATGCAGGTCT TCTGGCCAGA GCCCCAGTGA 
GTGCCCACGG AGGCCCCCAG CACACCCAAC GTGGCTTGAT CACCTGCCTG TCCAGCTCTG 
GTGGGCCAAG AACCCACCCA ACAGAATAGG CCAGCCCATG CCAGCCGGCT TGGCCCGCTG 
CAGGCCTCAG GCAGGCGGGG CCCATGGTTG GTCCCTGCGG TGGGACCGGA TCTGGGCCTG 

15 CCTCTGAGAA GCCCTGAGCT ACCTTGGGGT CTGGGGTGGG TTTCTGGGAA AGTGCTTCCC 
CAGAACTTCC CTGGCTCCTG GCCTGTGAGT GGTGCCACAG GGGCACCCCA GCTGAGCCCC 
TCACCGGGAA GGAGGAGACC CCCGTGGGCA CGTGTCCACT TTTAATCAGG GGACAGGGCT 
CTCTAATAAA GCTGCTGGCA GTGCCC {SEQ ID NO : 1 } . 

The above-exemplified isolated DNA molecule shown in Figure 1A-B and 
20 SEQ ID NO:l comprise 4946 nucleotides, with an initiating Met at nucleotides 828- 

830 and a "TAG" termination codon at nucleotides 4585-4587. The initiating Met and 

TAG termination codon are underlined. 

The present invention also relates to biologically active fragments or mutants 

of SEQ ID NO:i which encode a mRNA molecule expressing a novel DNA helicase, 
25 NHL. Any such biologically active fragment and/or mutant will encode either a 

protein or protein fragment which at least substantially mimics the biological 

properties of the human NHL protein disclosed herein in Figure 2 and as set forth as 

SEQ ID NO:2. Any such polynucleotide includes but is not necessarily limited to 

nucleotide substitutions, deletions, additions, amino-terminal truncations andcarboxy- 
30 terminal truncations such that these mutations encode mRNA which express a 

functional NHL protein, in a host cell, so as to be useful for screening for agonists 

and/or antagonists of NHL activity. 

The isolated nucleic acid molecules of the present invention may include a 

deoxyribonucleic acid molecule (DNA), such as genomic DNA arid complementary 
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DNA (cDNA), which may be single (coding or noncoding strand) or double stranded, 
as well as synthetic DNA, such as a synthesized, single stranded polynucleotide. The 
isolated nucleic acid molecule of the present invention may also include a ribonucleic 
acid molecule (RNA).' 
5 The present invention also relates to recombinant vectors and recombinant 

hosts, both prokaryotic and eukaryotic, which contain the substantially purified 

nucleic acid molecules disclosed throughout this specification. 

The degeneracy of the genetic code is such that, for all but two amino 

acids, more than a single codon encodes a particular amino acid. This allows for 
10 . the construction of synthetic DNA that encodes the NHL protein where the 

nucleotide sequence of the synthetic DNA differs significantly from the nucleotide 

sequence of SEQ ID NO: 1 but still encodes the same NHL protein as SEQ ED 

NO:2. Such synthetic DNAs are intended to be within the scope of the present 

invention. If it is desired to express such synthetic DNAs in a particular host cell 
15 or organism, the codon usage of such synthetic DNAs can be adjusted to reflect 

the codon usage of that particular host, thus leading to higher levels of expression 

of the NHL protein in the host. In other words, this redundancy in the various 

codons which code for specific amino acids is within the scope of the present 

invention. Therefore, this invention is also directed to those DNA sequences 
20 which encode RNA comprising alternative codons which code for the eventual 

translation of the identical amino acid, as shown below: 

A=Ala=Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGG, UGU 

D=Asp=Aspartic acid: codons GAC, GAU 
25 E=Glu=Glutamic acid: codons GAA, GAG 

F=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His =Histidine: codons CAC, CAU 

I=Ile =Isoleucine: codons AUA, AUC, AUU 
30 K=Lys=Lysine: codons. AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 

M=Met=Methionine: codon AUG 

N=Asp=Asparagine: codons AAC, AAU 

P=Pro=Proline: codons CCA; CCC, CCG, CCU 
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Q=GIn=Glutamine: codons CAA, CAG 

R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 
S=Ser=Senne: codons AGC, AGU, UCA, UCC, UCG, UCU 
T=Thr=Threonine: codons ACA, ACC, ACG, ACU 
5 V=Val=VaIine: codons GUA, GUC, GUG, GUU 
W=Trp=Tryptophan: codon UGG 
Y=Tyr=Tyrosine: codons UAC, UAU 

Therefore, the present invention discloses codon redundancy which may result in 
differing DNA molecules expressing an identical protein. For purposes of this 

10 specification, a sequence bearing one or more replaced codons will be defined as a 
degenerate variation. Also included within the scope of this invention are 
mutations either in the DNA sequence or the translated protein which do not 
substantially alter the ultimate physical properties of the expressed protein. For 
example, substitution of valine for leucine, argmine for lysine, or asparagine for 

15 glutamine may not cause a change in functionality of the polypeptide. 

It is known that DNA sequences coding for a peptide may be altered so as 
to code for a peptide having properties that are different than those of the naturally 
occurring peptide. Methods of altering the DNA sequences include but are not 
limited to site directed mutagenesis. Examples of altered properties include but 

20 are not limited to changes in the affinity of an enzyme for a substrate or a receptor 
for a Iigand. 

The present invention also relates to recombinant vectors and recombinant 
hosts, both prokaryotic and eukaryotic, which contain the substantially purified 
nucleic acid molecules disclosed throughout this specification. The nucleic acid 

25 molecules of the present invention encoding a NHL protein, in whole or in part, can 
be linked with other DNA molecules, i.e, DNA molecules to which the NHL coding 
sequence are not naturally linked, to form "recombinant DNA molecules" which 
encode a respective NHL protein. The novel DNA sequences of the present invention 
can be inserted into vectors which comprise nucleic acids encoding NHL or a. 

30 functional equivalent. These vectors may be comprised of DNA or RN A; for most 
cloning purposes DNA vectors are preferred. Typical vectors include plasmids, 
modified viruses, bacteriophage, cosmids, yeast artificial chromosomes, and other 
forms of episomal or integrated DNA that can encode a NHL protein. It is well within 
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the purview of the skilled artisan to determine an appropriate vector for a particular 
gene transfer or other use. 

Included in the present invention are DNA sequences that hybridize to SEQ ID 
NO:l under stringent conditions. By way of example, and not limitation, a procedure 

5 using conditions of high stringency is as follows: Prehybridization of filters containing 
DNA is carried out for 2 hours to overnight at 65°C in buffer composed of 6X SSC, 
5X Denhardt's solution, and 100 ^g/ml denatured salmon sperm DNA. Filters are 
hybridized for 12 to 48 hrs at 65°C in prehybridization mixture containing 100 jig/ml 
denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of 

10 filters is done at 37°C for 1 hr in a solution containing 2X SSC, 0.1% SDS. This is 
followed by a wash in 0.1X SSC, 0.1% SDS at 50°C for 45 min. before 
autoradiography. Other procedures using conditions of high stringency would include 
either a hybridization step carried out in 5XSSC, 5X Denhardt's solution, 50% 
formamide at 42°C for 12 to 48 hours or a washing step carried out in 0.2X SSPE, 

15 0.2% SDS at 65°C for 30 to 60 minutes. 

Reagents mentioned in the foregoing procedures for carrying out high 
stringency hybridization are well known in the art. Details of the composition of these, 
reagents can be found in, e.g., Sambrook et al., 1989, Molecular Cloning: A 
Laboratory Manual; Cold Spring Harbor Laboratory, Cold Spring Harbor, New York. 

20 In addition to the foregoing, other conditions of high stringency which may be used 
are well known in the art. 

The present invention also relates to a substantially purified form of a human . 
NHL protein which comprises the amino acid sequence (1219 amino acid residues) 
disclosed in Figure 2 and set forth as SEQ ID NO:2. A preferred aspect of this portion 

25 of the present invention is a NHL protein which consists of the amino acid sequence 
disclosed in Figure 2 and set forth as SEQ ID NO:2, as follows: 



MPKIVLNGVT 


VDFPFQPYKC 


QQEYMTKVLE 


CLQQKVNGIL 


ESPTGTGKTL 


CLLCTTLAWR 


EHLRDGISAR 


KIAERAQGEL 


FPDRALSSWG 


NAAAAAGDPI 


ACYTDIPKII 


YASRTHSQLT 


QVINELRNTS 


YRPKVCVLGS 


REQLCIHPEV 


KKQESNHLQI 


HLCRKKVASR 


SCHFYNNVEE 


KSLEQELASP 


ILDIEDLVKS 


GSKHRVCPYY 


LSRNLKQQAD 


IIFMPYNYLL. 


DAKSRRAHNI 


DLKGTWIFD 


EAHNVEKMCE 


ESASFDLTPH 


DLASGLDVID 


QVLEEQTKAA 


QQGEPHPEFS 


ADSPSPGLNM 


ELEDIAKLKM 


ILLRLEGAID 


AVELPGDDSG 


VTKPGSYIFE 


LFAEAQITFQ 


TKGCILDSLD 


QIIQHLAGRA 


GVFTNTAGLQ 


KLADIIQIVF 


SVDPSEGSPG 


SPAGLGALQS 


YKVHIHPDAG 


HRRTAQRSDA 


WSTTAARKRG 


KVLSYWCFSP 


GHSMHELVRQ 


GVRSLILTSG 
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TLAPVSSFAL 


EMQIPFPVCL 


ENPHIIDKHQ 


IWGWPRGP 




RRF SEECLSS 


LGKALGNIAR 


WPYGLLIFF 


PSYPVMEKSL 


EFWRARDLAR 


r/urn r rrnr n 7 

KMEALKPLFV 


EPRSKGSFSE 


TISAYYARVA 


APGSTGATFL 


AVCRGKASEG 


LDFSDTNGRG 


VIVTGLPYPP 


RMDPRWLKM 


QFLDEMKGQG 


GAGGQFLSGQ 


EWYRQQASRA 


VNQAIGRVIR 


HRQDYGAVFL 


CDHRFAFADA 


RAQLPSWVRP 


HVRVYDNFGH 


VIRDVAQFFR 


VAERTMPAPA 


PRATAPSVRG 


EDAVSEAKSP 


GPFFSTRKAK 


SLDLHVPSLK 


QRSSGSPAAG 


DPESSLCVEY 


EQEPVPARQR 


PRGLLAALEH 


SEQRAGSPGE 


EQAHSCSTLS 


LLSEKRPAEE 


PRGGRKKIRL 


VSHPEEPVAG 


AQTDRAKLFM 


VAVKQELSQA 


NFATFTQALQ 


DYKGSDDFAA 


LAACLGPLFA 


EDPKKHNLLQ 


GFYQFVRPHH 


KQQFEEVCIQ 


LTGRGCGYRP 


EHSIPRRQRA 


QPVLDPTGRT 


APDPKLTVST 


AAAQQLDPQE 


HLNQGRPHLS 


PRPPPTGDPG 


SQPQWGSGVP 


RAGKQGQHAV 


SAYLADARRA 


LGSAGCSQLL 


AALTAYKQDD 


DLDKVLAVLA 


ALTTAKPEDF 


PLLHRFSMFV 


RPHHKQRFSQ 


TCTDLTGRPY 


PGMEPPGPQE 


ERLAVPPVLT 


HRAPQPGPSR 


SEKTGKTQSK 


ISSFLRQRPA 


GTVGAGGEDA 


GPSQSSGPPH 


GPAASEWGL* 


(SEQ ID NO: 


:2) . 







The present invention also relates to biologically active fragments and/or 



15 mutants of the human NHL protein comprising the amino acid sequence as set forth in 
SEQ ID NO:2, including but not necessarily limited to amino acid substitutions, 
deletions, additions, amino terminal truncations and carboxy-terminal truncations such 
that these mutations provide for proteins or protein fragments of diagnostic, 
therapeutic or prophylactic use and would be useful for screening for agonists and/or 

20 antagonists of NHL function. 

Another preferred aspect of the present invention. relates to a substantially 
purified, fully processed NHL protein obtained from a recombinant host cell 
containing aDNA expression vector which comprises a nucleotide sequence as set 
forth in SEQ ID NO: 1 and expresses the human NHL protein. It is especially 

25 preferred is that the recombinant host cell be a eukaryotic host cell, such as a 
mammalian cell line. 

As with many proteins, it is possible to modify many of the amino acids of 
NHL protein and still retain substantially the same biological activity as the wild type 
protein. Thus this invention includes modified NHL polypeptides which have amino 

30 acid deletions, additions, or substitutions but that still retain substantially the same 
biological activity as a respective, corresponding NHL. It is generally accepted that 
single amino acid substitutions do not usually alter the biological activity of a protein 
(see, e.g., Molecular Biology of the Gene, Watson et al, 1987, Fourth Ed., The 
Benjamin/Cummings Publishing Co., Inc., page 226; and Cunningham & Wells, 1989, 
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Science 244:1081-1085); Accordingly, the present invention includes a polypeptide 
where one amino acid substitution has been made in SEQ ED NO:2 wherein the 
polypeptide still retains substantially the same biological. activity as a corresponding 
NHL protein. The present invention also includes polypeptides where two or more 

5 amino acid substitutions have been made in SEQ ED NO:2 wherein the polypeptide 
still retains substantially the same biological activity as a corresponding NHL protein. 
In particular, the present. invention includes embodiments where the above -described 
substitutions are conservative substitutions. 

One skilled in the art would also recognize that polypeptides that are 

10 functional equivalents of NHL and have changes from the NHL amino acid sequence 
that are small deletions or insertions of amino acids could also be produced by 
following the same guidelines, (i.e, minimizing the differences in amino acid 
sequence between NHL and related proteins.. Small deletions or insertions are 
generally in the range of about 1 to 5 amino acids). The effect of such small deletions 

15 or insertions on the biological activity of the modified NHL polypeptide can easily be 
assayed by producing the polypeptide synthetically or by making the required changes 
in DNA encoding NHL and then expressing the DNA recombihantly and assaying the 
protein produced by such recombinant expression. 

The present invention also includes truncated forms of NHL which contain the 

20 region comprising the active site of the enzyme. Such truncated proteins are useful in 
various assays described herein, for crystallization studies, and for structure-activity- 
relationship studies. 

The present invention also relates to isolated nucleic acid molecules which are 
fusion constructions expressing fusion proteins useful in assays to identify compounds 

25 which modulate wild-type NHL activity, as well as generating antibodies against 
NHL. One aspect of this portion of the invention includes, but is not limited to, 
glutathione S-transferase (GST)-NHL fusion constructs. Recombinant GST-NHL 
fusion proteins may be expressed in various expression systems, including Spodoptera 
frugiperda (Sf21) insect cells (Invitrogen) using a baculovirus expression vector 

30 (pAcG2T, Pharmingen). Another aspect involves NHL fusion constructs linked to 
various markers, including but not limited to GFP (Green fluorescent protein), the 
MYC epitope, and GST. Again, any such fusion constructs may be expressed in the 
cell line of interest and used to screen for modulators of one or more of the NHL 
proteins disclosed herein. 
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Any of a variety of procedures may be used to clone NHL. These methods 
• include, but are not limited to, (1) a RACE PCR cloning technique (Frohman, et al., 
1988, Proc. Natl Acad. ScL USA.S5: 8998-9002). 5' and/or 3' RACE may be 
performed to generate a full-length cDNA sequence. This strategy involves using 

5 gene-specific oligonucleotide primers for PCR amplification of NHL cDNA. These 
gene-specific primers are designed through identification of an expressed sequence 
tag (EST) nucleotide sequence which has been identified by searching any number of 
publicly available nucleic acid and protein databases; (2) direct functional expression 
of the NHL cDN A following the construction of a NHL-containing cDNA library in 

10 an appropriate expression vector system; (3) screening a NHL-containing cDNA 
library constructed in a bacteriophage or plasmid shuttle vector with a labeled 
degenerate oligonucleotide probe designed from the amino acid sequence of the NHL 
protein; (4) screening a NHL-containing cDNA library constructed in a bacteriophage 
or plasmid shuttle vector with a partial cDNA encoding the NHL protein. This partial 

15 cDNA is obtained by the specific PCR amplification of NHL DNA fragments through 
the design of degenerate oligonucleotide primers from the amino acid sequence 
known for other kinases which are related to the NHL protein; (5) screening a NHL- 
containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with 
a partial cDNA or oligonucleotide with homology to a mammalian NHL protein. This 

20 strategy may also involve using gene-specific oligonucleotide primers for PCR 
amplification of NHL cDNA identified as an EST as described above; or 
(6) designing 5' and 3' gene specific oligonucleotides using SEQ ID NO: 1 as a 
template so that either the full-length cDNA may be generated by known RACE 
techniques, or a portion of the coding region may be generated by these same known 

25 RACE techniques to generate and isolate a portion of the coding region to use as a 
probe to screen one of numerous types of cDNA and/or genomic libraries in order to 
isolate a full-length version of the nucleotide sequence encoding NHL. 

It is readily apparent to those skilled in the art that other types of libraries, as 
well as libraries constructed from other cell types-or species types, may be useful for 

30 isolating a NHL-encoding DNA or a NHL homologue. Other types of libraries 
include, but are not limited to, cDNA libraries derived from other cells. 

It is readily apparent to those skilled in the art that suitable cDNA libraries 
may be prepared from cells or cell lines which have NHL activity. The selection of 
cells or cell lines for use in preparing a cDNA library to isolate a cDNA encoding 
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NHL may be done by first measuring cell-associated NHL activity using any known 
assay available for such a purpose. 

Preparation of cDNA libraries can be performed by standard techniques well 
known in the art. Well known cDNA library construction techniques can be found for 

5 example, in Sambrook et aL 1989, Molecular Cloning: A Laboratory Manual] Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York. Complementary DNA 
libraries may also be obtained from numerous commercial sources, including but not 
limited to Clontech Laboratories, Inc. and Stratagene. 

It is also readily apparent to those skilled in the art that DNA encoding NHL 

10 may also be isolated from a suitable genomic DNA library. Construction of genomic 
DNA libraries can be performed by standard techniques well known in the art. Well 
known genomic DNA library construction techniques can be found in Sambrook, et 
al., supra. One may prepare genomic libraries, especially in PI artificial chromosome 
vectors, from which genomic clones containing the NHL gene can be isolated, using 

15 probes based upon the NHL nucleotide sequences disclosed herein. Methods of 
preparing such libraries are known in the art (Ioannou et al., 1994, Nature Genet. 
6:84-89). 

In order to clone a NHL gene by one of the preferred methods, the amino acid 
sequence or DNA sequence of a NHL or a homologous protein may be necessary. To 

20 accomplish this, a respective NHL protein may be purified and the partial amino acid 
sequence determined by automated sequenators. It is not necessary to detennine the 
entire amino acid sequence, but the linear sequence of two regions of 6 to 8 amino 
acids can be determined for the PCR amplification of a partial NHL DNA fragment. 
Once suitable amino acid sequences have been identified, the DNA sequences capable 

25 of encoding them are synthesized. Because the genetic code is degenerate, more than 
one codon may be used to encode a particular amino acid, and therefore, the amino 
acid sequence can be encoded by any of a set of similar DNA oligonucleotides. Only 
one member of the set will be identical to the NHL sequence but others in the set will 
be capable of hybridizing to NHL DNA even in the presence of DNA 

30 oligonucleotides with mismatches. The mismatched DNA oligonucleotides may still 
sufficiently hybridize to the NHL DNA to permit identification and isolation of NHL 
encoding DNA. Alternatively, the nucleotide sequence of a region of an expressed 
sequence may be identified by searching one or more available genomic databases. 
Gene-specific primers may be used to perform PCR amplification of a cDNA of . 
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interest from either a cDNA library or a population of cDNAs. As noted above, the 
appropriate nucleotide sequence for use in a PCR-based method may be obtained 
from SEQ ID NO:l either for the purpose of isolating overlapping. 5" and 3' RACE 
products for generation of a full-length sequence coding for NHL, or to isolate a 

5 portion of the nucleotide sequence coding for NHL for use as a probe to screen one or 
more cDNA- or genomic-based libraries to isolate a full-length sequence encoding 
NHL or NHL-like proteins. 

This invention also includes vectors containing a NHL gene, host cells 
containing the vectors/and methods of making substantially pure NHL protein 

10 comprising the steps of introducing the NHL gene into a host cell, and cultivating the 
host cell under appropriate conditions such that NHL is produced. The NHL so 
produced may be harvested from the host cells in conventional ways. Therefore, the 
present invention also relates to methods of expressing the NHL protein and 
biological equivalents disclosed herein, assays employing these gene products, 

15 recombinant host cells which comprise DNA constructs which express these proteins, 
and compounds identified through these assays which act as agonists or antagonists of 
NHL activity. 

The cloned NHL cDNA obtained through the methods described above may 
be recombinantly expressed by molecular cloning into an expression vector (such as 

20 pcDNA3.neo, pcDNA3.1, pCR2. 1 , pBlueBacHis2 or pLITMUS28) containing a 
suitable promoter and other appropriate transcription regulatory elements, and 
transferred into prokaryotic or eukaryotic host cells to produce recombinant NHL. 
Expression vectors are defined herein as DNA sequences that are required for the 
transcription of cloned DNA and the translation of their mRNAs in an appropriate 

25 host. Such vectors can be used to express eukaryotic DNA in a variety of hosts such 
as bacteria, blue green algae, plant cells, insect cells and. animal cells. Specifically 
designed vectors allow the shuttling of DNA between hosts such as bacteria-yeast or 
bacteria-animal cells. An appropriately constructed expression vector should contain: 
an origin of replication for autonomous replication in host cells, selectable markers, a 

30 limited number of useful restriction enzyme sites, a potential for high copy number, 
and active promoters. A promoter is defined as a DNA sequence that directs RNA 
polymerase to bind to DNA and initiate RNA synthesis.. A strong promoter is one 
which causes mRNAs to be initiated at high frequency. To determine the NHL cDNA 
sequence(s) that yields optimal levels of NHL, cDNA molecules including but not 
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limited to the following can be constructed: a cDNA fragment containing the full- 
length open reading frame for NHL as well as various constructs containing portions 
of the cDNA encoding only specific domains of the protein or rearranged domains of 
the protein. All constructs can be designed to contain none, all or portions of the 5' 
5 and/or 3' untranslated region of a NHL cDNA. The expression levels and activity of 
NHL can be determined following the introduction; both singly and in combination, 
of these constructs into appropriate host cells. Following determination of the NHL 
cDNA cassette yielding optimal expression in transient assays, this NHL cDNA 
construct is transferred to a variety of expression vectors (including recombinant 
10 viruses), including but not limited- to those for mammalian cells, plant cells, insect 
cells, oocytes, bacteria, and yeast cells. Techniques for such manipulations can be 
found described in Sambrook, et al., supra, are well known and available to the 
artisan of ordinary skill in the art. Therefore, another aspect of the present invention 
includes host cells that have been engineered to contain and/or express DNA 
15 sequences encoding the NHL protein. An expression vector containing DNA 

encoding a NHL-like protein may be used for expression of NHL in a recombinant 
host cell. Such recombinant host cells can be cultured under suitable conditions to 
produce NHL or a biologically equivalent form. Expression vectors may include, but 
are not limited to, cloning vectors, modified cloning vectors, specifically designed 
20 plasmids or viruses. Commercially available mammalian expression vectors which 
may be suitable for recombinant NHL expression, include but are not limited to, 
pcDNA3.neo (Invitrogen), pcDNA3.1 (Invitrogen), pCI-neo (Promega), pLITMUS28, 
pLITMUS29, pUTMUS38 and pLITMUS39 (New England Bioloabs), pcDNAI, 
pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl 
25 (Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593) pBPV-l(8-2) 

(ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), 
pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), and 
1ZD35 (ATCC 37565). Also, a variety of bacterial expression vectors may be used to 
express recombinant NHL in bacterial cells. Commercially available bacterial 
30 expression vectors which may be suitable for recombinant NHL expression include, 
but are not limited to pCR2.1 (Invitrogen), pETlla (Novagen), lambda gtH 
(Invitrogen), and pKK223-3 (Phannacia). In addition, a variety of fungal cell 
expression vectors may be used to express recombinant NHL in fungal cells. 
Commercialiy available fungal cell expression vectors which may be suitable for 
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recombinant NHL expression include but are not limited to pYES2 (Invitrogen) and 
Pichia expression vector (Invitrogen). Also, a variety of insect cell expression vectors 
may be used to express recombinant protein in insect cells. Commercially available 
insect cell expression vectors which may be suitable for recombinant expression of 

5 NHL include but are not limited to pBlueBacffl and pBlueBacHis2 (Invitrogen), and 
pAcG2T (Pharmingen). 

Recombinant host cells may be prokaryotic or eukaryotic, including but not 
limited to, bacteria such as E. coli, fungal cells such as yeast, mammalian cells 
including, but not limited to, cell lines of bovine, porcine, monkey and rodent origin; 

10 and insect cells including but not limited to Drosophila and silkworm derived cell 
lines. For instance, one insect expression system utilizes Spodoptera frugiperda - 
(Sf21) insect cells (Invitrogen) in tandem with a baculovirus expression vector 
(pAcG2T, Pharmingen). Also, mammalian species which may be suitable and which 
are commercially available, include but are not limited to, L cells L-M(TK~) (ATCC 

15 CCL 1 .3), L cells L-M (ATCC CCL 1 .2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL 
1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), 
COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), 
NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616),.BS- 
C-l (ATCC CCL 26), MRC-5 (ATCC CCL 171) and CPAE (ATCC CCL 209). 

20 As disclosed in Example section 1, a 1 15 kb BAC clone (from Genome 

Systems) was subcloned and subjected to restriction and sequence analysis. Four 
genes at chromosome location 20ql3.3 were identified, including M68/DcR3, NHL, 
SCLIP and ARP (Figure 5A). The nucleotide sequence of this BAC clone, hbm!68, 
is presented as follows: 

25 TGAAGAGCTT TGACCAAGAG GCTGTGACGA GGCCCTACGA GGACTCTGGC TCTCCTCCTG 60 
CTAAGCACAC CCAGGCAGGT GTCCTGGCAG ATGAGGACCA CATGCAGAGC CTCGGCCAGC 120 
CCACCAATGC CCGGATATGC AAGTGAGCCC AGCCTGGACC CCCCGGCGAG GCCCAGCAGC 180 
ACCAGCCCAG GCCCGAAAAC CTTAAGAAAT GACCAGTGTC TGCTGCTTTA AGCCACCAAG 240 
CTCTGCGGTG GTTTGTTAGG CTGCAAGCAT GGCTAATTCA GAAACTGCCA GAAACAAGCA 300 

30 CTGCTGTCCC CAGCCTGGGA CACACAGCAC CGCCTCTGCG TGGGGAGAGG GCACAGGCTA 360. 
AGGGCACAAA TGCCATCCCA GACCCGGCTC TTGTGTGTGG AAGGGGCCAC TGTGCCATGA 420 
GGCAGAGGAA ACCTTGGCAG GACCTTATGC CACAGCAATT TAAAAGAGAA GAAACAGGCT 480 
GGGCGTGGTG GCTCATGCCT ATAATCCCAG" CACTTTGGGA GGCCAAGGTG GTGGATCACT ' 540 
TGAGGTCAGG AGTTCAAGAC CAGCCTGGCC AATATGGTGA AACCCTGTCT CTACGAAAAA '600- 
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TACAAAATTT AGGCAGGCGT GGTGGCGGGT 
" GGCAAGAGAT TTACTTGAAC CCAGGAGGTG 
ACTGCACTCC AGCCTGTGTG ACGGAGTGAG 
ATCTGACTAG TGTGATCTCG . CAAGGAACAT 

5- TCCAAACAAG GAATCCCCAG GGGATCAAAT 
AGCCTGTAAC CCCAGTGCTT CGGGAGGCTG 
CAAGACTAGC TTGGGCAACA CAGTGAGAGC 
CAGTCCCAGC ACTGTACTAA AAATCTACAC 
AGTCCCAGCT ACTCAGGAGG CTGAGGCAGG 

10 GCCATGAGCC GTGACTGTGC CACTGCACTC 
CGAAAAACAA AAAATCATTT CATGTCTCCA. 
GTAAAAAATG GTTCTGACCC ACAGAAATCT 
TTTAAAAAAG AGATGGTCTC AGAATAAAGA 
TCACACCTTA GAAAGCTTTC TGGCCGCATC 

15 ATCAGTGAGG TGCCTTCGAA CAAGCCACCT 
CCGCACGCTC AGTAGGGGAG GGGAAGTCGC 
GACGTGCCAT AACCAAGAAG CCCCAGCCAC 
GCTGGGAAGG CCCAGGAAAA GGCGGGAGGC 
AGAAATGATC TCAACGTTTT GTAACTTCCT 

20 ACCAGCACTT GGTGATGTCG CTTGCGTCAA 
CTTTCCTCTG CAACTGTGGG CTTACGGGGC 
ACAGACCGGG CAAAGCAGAG GAGAGCCACC 
TCCCTCCAGA GACCCCTGTG GCCAAGGCCA 
AGCTGACGAC AGGGGTTGGT GAGGCCGGCC 

25 GAATTAAAAA TAATAACAAC TATGTCTTGT 
TCTTTCTTTT TCTTTTTTTT TTTCGGAGAC 
GCAGTGGCGC AATCTTGGCT CACTGCAAGC 
CCTCAGCCTC CTGAGTAACT GGGACTACAG 
CAAAAGAAAA CTGACACAAT CATAAACAGA 

30 CTCTACTTCT CACTGTTCTT CAAAGAGTTA 
TAACATCCTA GAAAAAAAGC TCCTACTCAG 
CCACGTTCCA CAGTGACCAC AGGAAACCAA 
ACTGCGCCAG CCGTGCGCCT GTGTCTGCCA 
CAAGACACCC ACAGAGGAGA GCTCTAAGCC 
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CCC TGTAATC 


CCTGCTATTC 


AGGAGGCTGA 


660 


cacztzctcctc 

UAUVJL X A \J 


CAGTGAGCTG 




720 




AAAAAAAAAA 




/ o u 


Trr , arar ap a 

l LLALAt—rt\-A 


p.tp. c a p.r* t a c 


AAP.PTTPTTP 


O *i \J 


l\j J, 1 1 1 1 


pp.pppap.ap a 




3UU 


ALL i LLLALL 


AL I LL i lLAkj 


1 LL A^jtjAL 1 1 


Q ^ Pi 

you 


CC ATT. ALLLA 


ppp , p ,p pr , p r npp 
LLL L 1 LL 1 LL 


LALA1LLL 1L 


i m Pi 


GGGGL L LLLL 


A 1 LL 1 LLL AL 


A1LLL IL lAL 


1 PiQ Pi 


AC GAT ILL 1 1 


LAALLLALLA 


LL 1 L ALLLL 1 


1 1 A Pi 


L ALT, L 1 L 1 LL 


AAL nu AAL. \jt\ 


P UPTPTPTTT 




GTT 1L 1LLAL 


1 LLL AAAALA 


CTC^CVC A A P 
L 1 L i H L AAL 


1 £ Pi 


AAGAAALLAA 


A A A A rp A T* A A A 
AAAA1 A I AAA 


A A A T A P A A A A 
ABA I ALAAAA 


1 10 Pi 


CCAALL 1 LLL 


P T A T* P P r P r P P T* 

L 1A ILL 1 1L 1 


LAL 1L 1 1LLL 


1 1 Q Pi 


TGGLLAAALL 


LLLALLL 1 LL 


PPP A TPTTPP 

LLLA1L1 ILL 


1 /I A Pi 


GLLLTLLALL 


PPPTPPTPfP 
LLL ILL 1 L 1 L 


TTPTPTPPPA 
I 1L IL 1LLLA 


i ^ n pi 


mn np mrT"T ,r P A P 1 

TAGLTTTTAL 


ttp apparTf 
1 1LALLAL1L 


i L 1 LLA 1 L AA 


1 ^ C Pi 


ACCCAGALLL 


LAIL1LLLLA 


L AALLLL i LA 


icon 


GGACGAATGG 


AAA TV'T'P A l "P r P 

AAATLT. L AT 1 


L ILTLLLLAL 


icon 
lb oU 


AC C AAG ALLL 


AL 1 L 1 I ALL 1 


L 1 LLLL 1 1LA 


1 "7 j1 Pi 


TCAAGGCAAC 


ALAAL 1 LALL 


ALLALLLLLA 


1 OrtA 

loUU 


A n A f~* A A T 1 P* Z'"* 

AAALAAL ill 


ALLLL ILLA^j 


PTPPapraTP 
L 1 LLALL A 1 L 


1 o 0 U 


C AGL L L ALL L 


I ALL 1 L 1 LLL 


TP AP APTPIPP 

1 LALAL 1 LLL 


1 Q 0 Pi 
1 j ^* U 


L L L AL AL L At? 


P A'PflTPPTTP. 






L AL AL L rvjV_ A 


P.A APPAPPAA 


PPAAPPAAPA 




pTTiar,rrAr 

111 rtftUv k—A\«. 


TAAGTTTTGG 

X jtjt.<j x, x x x 


■T\ X VJVJ X X X Vrf X X 


2100 


GCAGTCTCAC 


TCTGTTGCCC 


AGGCTGGAGT 


2160 


TCTGCCCCCC 


GGATTCACGC 


CATTCCCCTG 


2220" 


GTGCCTGCCA 


TTGGGTGTTT 


TCTTAAACAG 


2230 


GCAAGCAAGA. 


GAACTTGGCA 


ATTATTTCCT 


2340 


ACTCAAGCAT 


AAGATGTGAG 


CAAATTCTTT 


2400 


TGTTCATAAA 


GCAAAGCTAA 


CCTACAGGAG 


2460 


GACAGCAAGT 


GGGACACCAG 


CCTCCAGGGC 


2520 


CTGCCCTGGT 


CCGTCACTGC 


CACCAGCCGG 


2580 


ACAACTGTGT 


ACGAAGACAA 


CTGTGCAGGA 


2640 
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TTTTATTACT ACAACATTTT TGTTTTCTTT 
TGTCACCCAG GCTGGAGTGC AGTGGCACAA 
GTTCAAGCAA TTCTCCTGCT GCAGCCTCCC 
TGGCTAATTT TTGTACTTTT AGTAGAGATG 
5 AAATTCCTGA CAAGTGATCC ACCCACCCTG 
' AGCCACTGCG CCTGGCCCAT TTTTGTTTAT 
CACATTTCAA ATGGGTAACT CCAGTGTCCT 
CTCTTGGGTG TGAGAGTCTA GCATGTGGGT 
AGAGCCGGGC AGGGTAAAGA GACCCCCTAG 

10 TGGGCAACAC CCACTCCATC AGATGGACCC 
AGGCAAGAGA CCGAGGGCAC GAGCTGTTCC 
TTATACTAGA TATAGATCAT AAATATGATT 
AGCAATTACT CTTTATTCCA ATATTATAAT 
CTAGGAAATC ATAACCTAGG AAAAACTAGG 

15 GTGAGAACTG ACCAGAAGAC AAGAGTGCGA 
GAGGGCTCCT TGGTCTAGCG GTAACGCCAG 
CCGTGGTCTA GCGGTAGCCT CAGTGTCAAG 
AAGAGAGTCT CCCTTTCCCC GGGGGAGTTT 
GAGGGCCTGA CATCAGTCAG GCCCGCCCGC 

20 GATGCTGTGC TTCAGTGGTC ACGCTCCTAG 
GCTCTGCCTT CTAGATAGCA GCAGCAAATT 
GAAATAATGG CGTAAGCGGT CTCTCTCTCT 
AAGGGAAGGG CCGCCTGGCC AGTGGGCACG 
ATGGTTCACA CTCCTTACCC TGCCGCTTTG 

25 ATTCGGGGCC GCTACCAGTC TCCGCGTCTT 
TTTTTCTTTT ATCTTTGTCT TGTGTCTTTA 
GAGAAAACCC ACCAACCCTG TGGGGCTGGT 
TAGGCACGAC TCAGCTGCTG TAGTGAATTG 
GGACATCTGG CCCAGTCTCC TCGTTGAGCC 

30 CATGCTAATG GGCAATATGA GGACTAAATG 
TAAAAGGCAC TTTGCGGGGA AAGAAACTTT 
CACCCTGTCA CCCAGGCTGC AGTGCAGTGG 
CTCTCAGGCT CAAGCAATCC TCCCGCCTCA 
TGCCACCACA CCCAAATAAT TTTTATATTT 
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T TT T T ^1* T TT 


TTTGAGACTG 


AGTCTCGCTC 


2700 


TCTCGGCTCA 


CTGTAACCTC 


CATCTCCCTG 


2760 


AACTGGATTA 


CAGGCGCCCG 


CCACCACGCC 


2820 


GGGTTTCACC 


ATGTTGGCCA 


GACTGGTCTC 


2880 


GCCTCCCAAA 


GTGCTGGGAT 


TACAGGTGTG 


2940 


faATAAAAAT' 


GTAPTTAATG 


TTGAACTPTP 


3000 


TGATGPTPPT 


GPGAPATGTT 


CGTGAGAPTT 


3060 


p.p,tptpp,ap a 


G G AG AH 


GGAAG AGTGP 




GATGTGAAGG 


PCGCCCTGCA 


TTTGTCAGAP 


'3180 


TGGTATGGGP 


i GGCAAGPCAC 


CTAGGTGCPG 


3240 


\j\3 lui An X Art 


AATGPATAAA 


ATAAGAATAG 


3300 


a T A T 1 A Tfl A A T 


ATPATTPATP 


ATTAGTTTGT 
n, x x rt,vj i i lui 


3360 




TAAGPATAAP 


PTAGGAA AAA 


3420 


pp a tap a gag 

V_l_rti rtV»-rtV3rtVj 


AT AGG AG C TG 


AGGGGAPATA 


3 480 


GP PTTPTGTT 


ATGPPTGGAP 


AGGGP P AP P A 


3 540 




GAPGPPPGTT 


GPPAAGTGGA 


3 600 


pa a a a ap app 


PGPTAPTTAG 


PAAAPPAGGA 


JOOU 


apapaan apt 


PTAPTPPTPP 


APPTPTTGPG 


^790 

J ' few 


aPTfaTPPPP 


AGGPPTAAPP 


GTPTPPPTGT 

Ui^iLLL XoX 


3780 


IV-V-vj^V*. 1 X\-rt 


TGTTPP ATPP 


TGTGPAPPTG 


3 840 


aPTP a a aGTA 


PTGAAAGTPT 


PTGATAAGPA 


3 900 


PTPTPPTPTP 


TPTPTGPPTP 


AGPTGPPAGG 


3960 


TGAPPPAPAT 


GACCTTACCT 


ATCACTGGAC 


4020 


TCTTGTATCC 


AATAAATAGC 


GCAACCTGGC 


4080 


GGTGGTAGTG. 


GTCCCCCAGG 


CCCAGCTGTC 


4140 


TTTCTACACT 


CTCTCATCTC 


CGCATACGAG 


4200 


CCCTACACCC 


TGGCTTTGTA 


GACTGGAGCC 


4260 


CGATCCTCCA 


AACCCAGCAA 


GGCACCTGCA 


4320 


AGTTCACGAA 


AAAGAGACTT 


TTCTGAGTGA 


4380 


GGATGGTCTC 


CAACTTGGAC 


AAACCAACAG 


4440 


TCCTTTTTTC 


TTTTTTTTGA 


GACAGGATCT 


4500 


CATGACCTTG 


GCTCACTGCA 


GCCTCAACCT 


4560 


ACCTCCCATG 


CAGCTGGGAC 


CATAGGTGCA 


4620 


TTTGTAGAGA 


CGAGGTTTCA 


CTATGTTGCT 


4680 
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CGGGCTGGTC TCAACTCCTG GGCTCAAGCA 
AGATTACAGG CAGGAGCCAC CAGGCCTGGC 
AATATTAGAA GAGATGAAAA TTCATCAACA 
AACCAGAATG GAAGCTGTGT GTACAGTGGG 

5 CAGGGCTCCA CGGTGTGGCG ACGCCCCATG 
ATGAGGACCT GGGGTGCTTT TCAACATGTA 
AAGAAAAATT CATATGTTTC TATATAAACA 
TGACAGTAGG AATCTCTAGG TAGTAGTAAT 
TTTTCTAATT TTCTTTTTCT TTCTAAATAA 

10 ATATTTTGTA ATAAAAATGG TTATATTTAA 
AAATGGCAAA TTAGACACAC ACATGTGGGC 
GCACTTTGGG AGGCTGAGGC AGGCAGATCA 
CCAACATGGT GAAACCCCGT CTCTACTAAA 
ACACCTATAG TGCCAGCTAC TTGGGAAGCT 

15 CAGAGGTTGT AGTGAGCCGA GATCATGCCA 
ACTCCAACTC AAAAAAAAAT AAAAATAACA 
TCACAACAAT GCAGAGGGAA GAGCTTCCAA 
GTGCGCGCAG ACGCAGAGAG CAGCAAGGTG 
AGATGACTGA TGCTCAGTTT ATACCCCAAA 

20 TACCTCTGCC AGCAGGAATG AGGGAAAGGA 
ATCCAGCTAC CTGCCATCCT CAGCCCCACT 
CCCTGCCTAC AGGACCCTGA GAGCTAGGGG 
AAATTCATAT GTTGAAGGCC TCATCCCCAG 
GAGATAATTT AAATGAGGTC ATATAAGTTG 

25 GTGGTTTTTT TTTTTTTGGA GACTGGGTCT 
CACGATCACG GCTCACTGCT GTCTCCAACT 
CTCCTGAGTA GCTGGGACTA CAGGTGCTTA- 
TTTTTGTAGA GATGGGGTTT TGCCATGTCG- 
ATCTGTCTCC CTCGGCCTCC TGCAGTGCTG 

30 GGACCGTGAC CTTCTAAGAA GTGAAAGAGA 
ATGAGGACAC AGCAAGAAGT CGGCCATCTG 
GAACCTGGCA GACCCTGATC TTGGACTTCA 
GCTGTTCAGG CCACGCGGTC TACGGCAGCC. 
AGTCAGAGCA GATCAGGAAG AAAGGCCTAG 
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ApppTPPPAC 


CTCAGCCTCC 


CAAAGTGCTC 


4740 


PAAPATAGGA 


AGAAATTTAA 


ATTTGAATTG 


4800 


tppaaapapa 


AAGATCATTA 


AC T AAAG CPA 


4860 


GTPTPATGPT 


GGGAACGC GA 




4920 


b 1LLL 1.1 lul 




PPAfZPPPA AP 
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Lb I bAb i 1 In 
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bbl 1 lAAbbA 




b AAL A 1 L 1 (jo 


AAAb A 1L iftl 


TPT A APPTPT 
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^1 fin 
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L 1 1 A 1 L Ab 1 A 


^1 £n 
D 1 0 u 


1 1L1 Abb 1 A 1 


GAAATAATTT 


TCT AC CAT AT 


D £ i* U 


J. 1 1 I 1 lAAAb 


GCTGTACAAA 


CTTCCTGATA 


JtOU 


Cobb 1 ALAb i. 


GGCTCGCGCC 


TGTAATTCCA 


D J *4 U 


LL 1 AAbb 1 LA 


GGAGTTTGAG 


ACCAGCCTGG 




TATALAAAAA 


TGAGCTGGAT 


GTGGTGGCAC 




GAbbLAbbAA 


AATTGCT.TCA 


ACCCGGGAGG 




LTbL AL I Lb A 


GCCTAGGCAA 


CAAGAGCGAG 


n 


pip A rTTT A A 

CALALbTbAA 


TAGGCTCCTC 


ATGGAAGTCA 




AbTb rAAAbb 


CAGAAGCGAG' GAGCAGGAGG 


3 / U vJ 


C AbAL 1 bAbA 


GGCGGAGGCT 


GGCCGTGGGG 


^7 fin 


TL Lb I AAA 1 L 


TAGAGGCCTG 


GCACATCAAC 




bb bL AAL L AA 


AAGATGTCCC 


ACCCTCACCC 


JOOU 


bb L AbAAb Ab 


CCTGAGAGGT 


GGAGGCAGGC 




AAbbbbi lAl 


CCTGAACTGT 


GTCCCCCGTA 




rfpTf A PTPT A 
1 b 1 bAb 1 b 1 a 


TTTAAAGATG 


GGGTCTTCAG 


6060 


bbbk* ivnlLL 


AGTAAGACTT- 


TGACCTTCTG 


6120 


PAP TCT ATC A 


CTCAGGTTGG 


AGTACAGTGG 


6180 


CCTGGGCTCA 


GGTGATCCTC 


CTGCTTCAGC 


6240 


CCACCGCACC 


CAGCTGGTGG 


TGCATTGTGT 


6300 


CCCAGGCTGG 


TCCTGAACTG 


GGCTCAAGTG 


6360 


GAATTACAGG 


TATGAGCCAC 


CGCGCCTGGC 


6420 


AAGATCTTTC 


TCTCTCCCTC 


CCTCTCCATC 


6480 


CAAGGTAGAA 


AGCGAGTCCT 


CCCAACAGCT 


6540 


GCCTTCAGAG 


CTGTAAGAAA 


ATAACTCTCT 


6600 


CGAGCAGACT 


AAGACACACG 


CCATCTGGGG 


6660 


AGCTCAGGAT 


ACTGAAGGTC 


CCAACCCGGT 


6720 
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GCTGGACLAb 


a pp a p rrr p p 

Ab b Ab L b b b b 
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CGGCTCGGGT 
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CAGGGTACCT 


CACACCAGGA 


ACCCACACAG 


GTbbATbl b I 


T'PP'T^PTPTP 7i 
1 bb i b 1 b 1 b A 


TP AP APaap A 
1 b Ab AV, AAb A 






GCCTCCACCA 


CCCTGACATG 


CAGGAGGGAG 


pmp) AAA pP fp 

b I b AAAbb b 1 


PPPP.TPP A AP 
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A AP APPPTPP 
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ACAGCAAGGG 


AAGAAAGGCA 


GG AAGG AAb T 


p a ppppp taPP 
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TPPTPPP APP 
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P APP APPTPP 
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/ O £ u 
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TCGTCCTCTG 
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AGAAAGTCAA 
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IbAblbAI Al 
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7 7 d n 




CATGAAGATG 


CCACACATGA 
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GGATb AbAbb 


A A PPPP A. ApA 

AAbbbbAAbA 


ptppppp apa 
b lbbbbbAbA 


7 fl Pi 0 




CCAGAGCCCC 


AGAGCCCTCT 


GGGGAGGAAG 


A A T> A A PP A TV 1 

AAT AAbb A 1 b 


bb Abbb I bbb 


Ab i bbbbbbb 


7 asn 
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GCTGACTCTG 


CCTTTATTTC 


ACCCCAGCAG 


A ppp A PPAPT 

AGGbAbbAb 1 


PTiPaPPPPPT 
bAbAbbbbb 1 


P AP APP APP A 
bAb Ab b Abb A 






GCAGCTCTGC 


CACCTCCTAG 


CAGTTCCACb 


r PA/TT*/"'P7iPP 
1 AbbbbbAbb 


a a a ap a a app 

AAAAbAAAvjv, 


TPPP A PTTTP 
1 Vjvjv- t\\j 111b 


'JOU 




GGCAAATGTT 


AGCGTTTTTG 


CCAACTAAbA 


Timmpi a A TPPP 
I I lbAAlV,vjVj 


AP ATPTPPTA 


P AP.AP.ATPAP 

V^rtVjrtVJrt 1 VjrtVJ 


8 040 




. GAAGAAAACA 


CTCACAGTTT 


CATGAAbAb 1 


PTPa HP A A A A 
vj i V-AA\j«-rtrlrt 


TP APTPAPTP 


TTP APTTP AT 


3100 




TTATGAAAGG 


/-» r> t\ P/imprnP(Ti 

CCAGCTLTbT 


bAbA 1 bbb I A 


PP APTPPPTP 

V-V- AV. 1 V—v_ V^. L V^ 


X V— iiV-.iV A VJ-T^VJfT 


AATCACGGCr 


8160 


25 


mm/Tin ~ts /~* f~* is. 

TTTCAGGACG 


rrTT* a (TT" 1 a p p 
TGb AGb bAb b 


IbbbbAlbv—rt. 


P,P,TAPfiGnAC5 

VjVJ i. rtVrf OVJO/TCVJ 


GCCTCCCCGC 

VJV-» V_ 1\.V<V«V«\JV< 


AGCTGCAGCT 


8220 




GGGTCTTCTG 


GTCCCCGTGC 


CATTTCTGCT 


TTTCTTCGCT 


CTCTACTTAC 


ACACACATTT 


8280 




GAGTCCAGTC 


TCAGAAGAAC 


TGGAACTAGA 


AAAATCCTGA 


CACTTGTCCC 


TTACTACGTT 


8340 




AATGCCAGCT 


GTGCCAAGGA 


CAGCCCAACC 


CAAGCCCCCA 


TCAGCCCCAA 


■TGGCACCGAG 


8400 




GCCCGAGCTT 


ACCCGTGAGG 


GGCCAAGTTG 


GTCGTCACCA 


ACACGGTCTT 


CACCCCGTCC 


8460 


30 


ACACCACTGC 


CGTCCACTGC 


AGTGTCCGGA 


GTTGTCACAA 


CCACCACCTC 


CTCCATGTGC 


8520 




ACACTCACGT 


CGGGAGTCGC 


CATGGCTCAG 


CGGAAGGGGA 


CGCCCAGGCC 


AGCAGCGTCA 


8580 




GTCCTCCAGG 


GTCCCAAGTC 


CTGGAGGAAG 


CAAGGCAGGG 


CACAGGGATG 


GAGTCATCTC 


8640 




CACATCCACA 


CAACATAGCA 


CTCACAAAGG 


CATCTCTAAT 


CAGCTCCAAA 


GACCCACCCT 


8700 




TGAGTCCCAG 


ACTGCTACCT 


CCTGACAAAA 


ACGAGCGGCA ACAGAAGGGC 


TACTCCAGGC 


8760 
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TCTGGTTCCG AGGGCGGTGT 


AAGCGCACTC 


CACCCGTTTT 


lbbbAb ibbA 


n"*A A f" , f" , <T' , A a A 

1 AAbL b b AAA 


p a *5 n 
ooz u 




CCCTTGGGTA GAAAGCACAG 


AGCCACTCCC 


TCCACGTGGG 


bb IbAbAbbA 


Abb Ab Abb 


0 0 0 u 




AGGGGCCTGG AATTCCAAGC 


AACTTCCCTG 


GACGCAGGCT 


bbbbbbl 1Gb 


bAb 1 I b 1 tbb 


on/in 




GTCTCTCCTG GCCTGAACTC 


AAAGCCAGCC 


CCAATCCCTG 


A A r*rT*f* A prpmm 

AACTGAGTTT 


bAbGTGCAGA 


yuuu 


5 


AAGCACTCCA AGAAGTCCTC GCTGGTCTGT GGAACGGGAA 


GGGAAACCCA 


TTCAAGACAG 


yoou 




AAAGAGAGGA GGGAAACGCC 


CTGGGTTTTT 


TTbbb 1 i 1 I I 


GGGTTTTTTT 


TGAGACGGAG 


9120 




TCTCGCTCTG TCGCCCAGGC 


TGGAATGCAG 


i Gbb AbbAbb 


TCGGCTCACT 


GCAAGCTCCA 


9180 




CCTCCTGGGT TCAAGTGATT 


CTCCTGCCTC 


AGCLTLT. bbA 


ATTGCTGGGA 


TTACAGGTTT 


9240 




* CACCATGTTG CCCAGGGTGG 


TCTCAAACTC 


CTbALb LbAb 


GTGATCCACT 


CACCTCGGCC 


93 00 


10 


TCCCAAAGTG CTGGGATTGC 


AGGTGTGAGG 


CAbbATbbb I 


GGCCTGCCCC 


GGGTTTAAAA. 


93 60 




ATTATTATTA TTTTGTCTTT 


CCTGGCTTTG 


CCTTbAbbAA 


GTCCAACCCC 


TGCTAAAACC 


9420 




CGGTGATAAT GGCTGTCCTG 


GCCCAAAAAG 


s^mmfif A /"* A f~* A 

CTTGGAbAbA 


GGGGAATCTT 


CCTCCTGACT 


9480 




AAAGGAATGG TGGCCCAAGA 


GTGTGGGGGC 


TCCCTGTTbb 


CCTCTCACTC 


TCCATCCCCT 


9540 




ACCTAGCACA GGGAACACAA 


AAGCCCCTGG 


TTTCCAbbbA 


GAGGGCAACG 


AGCCTGGAGT 


9600 


15 


CAGAGTGTGG GGGAGGCGAC 


AAGAGGAGAG 


GGGAGAAGAG 


AGGATGGCAC 


ACAGCTGTGT 


9660 




. GTGAGCGCCT GGGTCGTCCC 


AAGACAGTCT 


CTACGTGGTC 


CTGACCCTAA 


AGGGCAAAGG 


9720 




GAAGAAAACT GACCTACAGG ATAGGCCACT 


GCCCAGGTCT 


CAGATGGGCC 


CCAGTGGCGC 


9780 




ATATGGGACA GATCCACAGT 


GCACTGGAAA 


GTCTCTAAAA 


TAAACTGGCC 


TAAGAACACA 


9840 




GACACAGGAA CGGGGTGCAA 


AATTTGCAGC 


CTGAACCTAA 


CCAGGTCGAT 


TTCTTGCTAT 


9900 


20 


GAAAAAAAAA AGTCTACATT 


CTCTGTGAAA 


CTTAAAACAA 


GACCTAGAGT 


CCATAGCACA 


9960 




GTAGTCAAAG CATCCAGAAC 


ACGATCAAAC 


TTCCTGGCAA 


AGGGTAGTCT 


GGTTGATTCT 


10020 




CAAAGGAACA AATACACAAG 


AGAAGCTGGC 


TCTTGAACGC 


AGAATCCAGA 


GACTTTCAGG 


1 f\ rt 0 A 

10080 




TGCTATCGGA CCAGCTCCAA 


GAGGAAAGCA 


AACATTGTCA 


ACCAAGTGGA 


7v 71 t» j\ 71 71 morn 

AAGAAAA TCT 


1U14U 




TGGTATAGAA ACAGGAGTTA 


TAACCAAACA 


GAAATGTGAA 


AATTAAAAAC 


P7\P7\ APPTi 7\ A 

GAbAAbbAAA 


xuzuu 


25 


AGAAAATACA CAAAGCTGGG 


ATAGTCTCAG 


CTACTCGGAA 


GGCGGGGCTG 


bAbb A i. bb I I 


1 no c n 




TGAGCCTAGG AGATTGAGGC 


TGCAATGAGC 


TGTGATCACA 


CCACCGCACT 


CCAGTCTGGG 


10320 




CAACAGAGTG AGAACTCTCT 


CAAAAAACGA 


AAAAGAAAGA 


AAGTAGAACA 


GAAGTGACCA 


10380 




GGGGCTGGGG GAGGGAGTAC AGGGAGTTGT TCTTTAATGA GTACAGAATT 


TCTGTTTGGG 


10440 




ATGATGAAAA GCTCTGGAAA 


TGGACGGCGG 


TGATGGCTGC 


ACAATCACTG 


TGGCTGTTCT 


10500 


30 


GAATGGTGCT GAACCACACA 


TTTAAAAACA 


GTTAAAATGG 


GCTGGGCGTG 


GTGGCTCACG 


10560 




CCTGTAATCC CAGCACTTTG 


GGAGGCGGAT 


CGCCTGAGGT 


■ CAGGAGTTCG 


AGACCATCCT 


10620 




GGCCAACACA GTGAAATCCT 


GTCTTGACTA' AAAATACTAA 


AAATTAGCCA 


GGCATGGTGG 


10680 




CAGGCACCTG TAGTCCCAGC 


TACTTGGGAG 


GCTGGGGCAG 


GAGACCTGCT 


TGAACCCAGG 


10740 




AGGCAGAGGT TGCAGTGAGC 


CGAGATCGTG 


CCACTGCACT 


■ CCAGCCTGGG 


CAACAAGAGC 


10800 
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GAAACTCCAT CTCAAAAAAA AAAAAAAAAA 
TTATGTTATG TATATTTTAC CGTAATAAAA 
CATTAGGATT AGATATAGAC TAGAAAGTAC 
ATTTTCATGG CCAGGCATGG TGTCACACCT 
5 AGAGGAATGC TTGAGCTCAG GGGTTTGAGA 
TCTGCTAAAT AAATAATAAA AAATAGCCAG 
CTACTCTGGA GGCTGAGGCA GGAGGATCAC 
CCATGGTTGT GCCACTGCGC TCCAGCCTGG 
AGAAAGAAAG AAAAGAAAAA AGAAAGAAAA 
10 TTGTTAATAT AAAAATAAGA TCACCTGCCC 
GACAGGAGTG" GGTCCACCCC AACCTGGCAC 
GCACTCACTA AGAGGAGGCA CTCATAGCCC 
CCAGAACTAA CTCACCCAAA CTGAACAATC 
TTCTTGCTAA ATCTTACTCA ACCGACATTT 
15 TAGGGAGAGG AAGTCCGCAA GAGAACAAAA 
CCATTCCACA. GGCTGTCTTT TGTCTGGGTA 
TACCTGGGCA TGGGTCTCCA CACAACTCCA 
AGAAAAGAAA GCTACGATTT CCTGGGCCAC 
' AACCCTTGAC CAGAGATCCA AGGAACCAAG 
20 TGGTTAAACA CCATGAAGGA AAGACCCATT 
GGGGAGCTGG GTACCACCCG CAGCAAAGAC 
TGGGAGAGCA GTGAGTGTGG CGGACGGTGA 
TAAGAAGGGG CAGCGCCCAG AGCCCAGAGA 
TCAAAGTATT CACGGAACAC ATCTAAACAC 
25 AAAATATTCC CCGTTTCCTG • TTTCAGGAGG 
TATTGTTTAA ATGAGGAAAA AAAATTTTTA 
CCGTTTACTG TCACTCAATT TAAATGGTGG 
GGGGCTCACA GGTGAAGCCC TCCAGCTGCT 
TCTCTGAAGA CTGTCTTCAG ATGCTTTTTA 
30 ACACTGGCGG AATTACTTCG TATTTCTCAA 
GTTGTAGACT TAGGCTTCAA AATAAAGAAC 
AGGCCGGGCG CGGTGGCTCA CACTTGTAAT 
ATCATAAGGT CAGAAGTTCA AAGACCAGCC 
• AAAAATACAA AAATTAGCCG GGCGCGGTGG 
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AAAAAAAAAA 


7i A/ r, T"T l *T>7i 7A 7a & 

AAb III AAAA 


Ibol 1 AAA 1 1 


lUo bU 


ACACTGTAAT 


GL T AL T A T AA 


I AbAAIbAL 1 




AGAATATAAA 


AAC TTTTT AA 


ACAAAGAAAA 


i noon 


GTAATCCCAG 


GACTTTGGGA 


GGCCAAGGCA 


11040 


CCAGGCTGGG 


CAACACAGCA 


ACACCCCATC 


11100 


GCATGGTGGT 


GTGCACGCCT 


GTAGTTGCAG 


11160 


TTAAGCCCAG 


GAGGTCAAGG 


CTGCAGTGAG 


11220 


GCAACAGATC 


AAGACCTTGT 


CACAAAAAAA. 


11280 


TAAAATCTTC 


CAGAACTTTT 


AAAATCATCA 


11340 


CTAGGACTGT 


AACAAACAAG 


TGTGTCTAAG 


11400 


GCAGTGGTCC 


CCTGCGGAGA 


GTCTGGCCCT 


11460 


AGCCAGGCCT 


CTGCAATTAT 


GCCTTCAATG 


11520 


GATCACAAAA 


TGTGCCTTCA 


GGTCTCAAGG 


11580 


TCCAGCATGG 


GAACATTTTT 


CTGAATGTCT 


11640 


GGTCCTCAGG 


CCACCCTAGC 


TTCTTTTCCT 


11700 


TGCACTGGAC 


CAGGGGGCTC 


TACTTCTTCC 


11760 


AGGTAAAGGG 


CCACAGGCAA 


GATAAAGGGG 


11820 


CAATCGCAAA 


TGGCAGCCAG 


TCTCTGAAGT 


11880 


AAATGTAGGT 


GATCTGAACA 


GAGGGGATGG 


11940 


CTCAAAGAAA 


AGGAAGCAAA 


AAGAAACCGT 


12000 


CCCGCACGCG 


TTACTGACGC 


CAGCCTGGCC 


12060 


GTGGCGGGGA 


GGGCTGTGGT 


AGGTTTAGGG 


12120 


ACACCAGTGA 


GGGCTCCACA 


GGAACACTAC 


12180 


AAGCACTAAG 


GACTAAGTGC 


GAGGGACAAG 


1 O O A ft 


GTATCGAAAA 


TGAGTGATGG 


AAGGAAAATG 


1 Tjnrt 
Ll J UU 


CAAATTAAGA 


ACATCCTGGA 


ACATGATGAG 


12360 


CCATCTAGGA 


CAGAGCGCCT 


AAGGGGAAAG 


12420 


GGTGGGCAAT 


TTCCCATTAG 


GGCATCAGGG 


12480 


GCCAGGAAAG 


TTACAATGAT 


GAATTCGTTT 


12540 


ATATAATGTT 


TTCACTAGCA 


TAACTTTGTT 


12600 


TTTAAACAAA 


CATGAATAAA 


AAGCCACTTT 


12660 


CCCAGCACTT 


- TGGGAGGCCG 


CGGCGGGTGG 


12720 


TGATCAATAC 


GGTGAAACCC 


CGTCTCTACT 


'12730 


CAGGTGCCTG 


TAATCTCAGC 


TACTTGGGAG 


12840 
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GCTGAGGCAG GAGAATCGCT TGAACCTGGG 
CCACTGCACT CAAGCCTGGG TGACAGAGTG 
TAAAATTTTA CTCAGGCCAG GTGTGGTGGC 
CCGAGGCGAG CAGATCACCT GAGGTCAGGA 
5 CTTGTCTCTA CTGAAAACAC AAAAATTAGC 
GCTACTCAGG AGGCTGAAGT .GAGAGAACTG 
GCCAAGACTG CACCACTACA CTTCAGCCTG 
AAAAAAAATT CAAAAATTTG GCCAGGCGTG 
GAAGGCCGAG GCGGGTGGAT CACCTGAGGT 
10 ATGAAACCCT GTCTCTACTA AAAATACAAA 
GCGGGTGCCT GTAATCCCAC CTACTTGGGA 
GAGGCAGAGG TTGCAGCGAG CCAAGATTGT 
GAGACTCTGT CTCAAAAAAA AAAAAATTAA 
CTACTGATCT CCCGTGCTGA CTTCTCGGGG 
15 ATAAGGGTAA GCTCAGCAGG GGCAACTAAA 
ACAGCCTCAC TGCAGGCAGG GGTCCTCGTC 
AATGTACACG CACAAAACTG AATTCCAGCC 
TTCTTTTTTT GAGACGGAGT C.TCGCTCTGT 
CGGCTCAATG CAAGCTCTGC CTCCCGGGTT 
20 TAGCTGGGAT TACAGGCGCC CACCACCACG 
CGGGGTTTCA CCATGTTGGC CAGGACAGTC 
TCTGCCTCCC AAAGTGCTGG GATTACAGGC 
ACTATATTAA AGGTTCAAAG CAATATGCAC 
CCTTGATACA ATGCTCTTTA GACACAGAAG 
25 TTGCCTTCCC AAGGGCGAGC ACCCCCTCCT 
GATGGAGTCT CGCTCTGTCA CCCAGGCTGG 
ACCTCCGCCT CCCGGGTTGA AGTGATTCTC 
CAGGCACCCA ACACGCCTAG CTAATTTTTG 
TTGGCCAGGC TGGTCTCGAA CTCCTGACCT 
30 GTGCTGGGAT TATAGGTGTG AGCCACTACA 
AAGCCAAGAC' CCACCCAGCT TTGATCCCAA 
TTTCCAAATT ATTAGGTAAA TCAGATATGA 
GAGACGCAAT CTTGCTCCGT CACCCAGGCT 
CAACCTCTGC CTTCTGGGTT CAAGCAATTC 
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CAGCAGAGGT 


TvjCAG TvjACjL 


L AAG ATC ATG 


12900 


AGACTCTCTC 


TTAAAAAAAA 


AAAGCCACTT 


12960 


TCACGCCCAT 


AATCCTAGCA 


CTTTGGGAGG 


13020 


GTTAGACCAG 


CCTGGCCAAC 


ATGGTAAAAC 


13080 


TGGGCGTGGT 


GGTGTGCCCA 


TGTAATCCCA 


13140 


CTTGAACCCG 


GGAGGCAGAG 


GCTGCAGTGT 


13200 


GGCGACAGAG 


CAAGACCCTG 


TCTCAGAAAA 


13260 


GTGGCTCACG 


CCTGTAATCC 


CATCACTTTG 


13320- 


CAGGAATTCA 


AGACCAGCCT 


GGCCACCATG 


13380 


AAAAAAAAAA 


CAAATTGGCC 


GGGCATGGTG 


13440 


GGCTGAGGCA 


GGAGAATCTC 


TCGAACTCCG 


13500 


GCCACTGCAC 


TCCAGCCTAG 


ACAACAGAGC 


13560 


AATTAAAAAA 


TAAAAATTTC 


ATTTAAAATA 


13620 


TTTAACTCTC 


ACTGAGGAGA 


CGCTGCTTTC 


13680 


GTCATTTAAG 


CAGAGAGCTG 


CAAAGAGGCA 


13740" 


ACAGCTTCAG 


GGCTTTGCAG 


AGGATTACGC 


13800 


TCTCCATTGG 


CAACTGCATA 


CATACATATA 


13860 


AGCCCAGGTT 


GGACTGCAGT 


GGCCCGATCT 


13920 


CAAGCGATTC 


TCTTGCCTCA 


GCCTCCTGAG 


13980 


CCCGGCTAAT 


TTTTGTATTT 


TTAGTAGAGA: 


1404.0 


TCGATCTCCT 


GACCTCGTGA 


TCCGCCCGCC 


14100 


GTGAGCCACT 


GAGCCTGGCC 


TCCAATGGCA 


14160 


AAAAGTTACC 


TCACAGAAAA 


TAGTGCAAGT 


14220 


AAGCACTATA 


GAATAGAGCA 


CCTCGCCCTA 


14280 


CTCTCCACAG 


CTCCTTCTTT 


GTTTTTTTGA 


' 14340 


AGTGCAATGG 


CAAAATCTTG 


GCTCACTGCA 


14400 


CTGCCTCAGC 


CTCCCGAGTA 


GCTGGGACTA 


14460 


CATTTTTGGT 


AGAGACGGGG 


TTTCATCATG 


14520 


CCAGTGATCC 


TCCCACCTTG 


ACCTCCCATA 


14580 


CCTGGCCTCT 


CCACAGCCCC 


TTCTGTGTTG 


14640 


GGCTTGGGTT 


CCCCACTAGT 


GTGAAGTGAG 


14700 


GAAAATATTT 


TATTTTACTT 


T1* T? TIT T T 


14760 


GGAGTGCAAT 


GGCACCATCT 


CCACTCACTG 


14820 


TCCTGCCTCA 


GCCTCCCAAC 


TAGCTGGGAT 


14880 
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TACAAGTGCA CACCACCACG 


CCCGGL I AAL 


111 lolAl 1 1 


TTAGTAGAGA 


CAGGGTTTCA 


14940 




CCGTGTTAGC CAGGCTGCTC 


TCAAALILLI . 


bAL.L 1 LA I on 


TCCGCCCACG 


TCGGGCTCCC 


15000 




AAAGTGGTGG GATTACAGGT 


GTGAGCCATC 


AL.ALL lbbUL 


CAAGAAAATA 


TTTTTAAACT 


lJuDU 




AGTATTCTTG ACCGGCACGG 


TCAACACTGA 


IblnAl lbAA 


ACTGTTGTAT 


TTGAAGTGTT 


i s i 9 n 


5 


AGCAAAGAAA GAGAATTCTG 


GTTCAACAGA 


AAAGTCAGTL 


ACGACTTTTC 


AGTCACGCAT 


1 J 1£3 U 




GAATTACACA GTAACCAAAT 


AGATAACATG 


CCATGACTGA 


CGACGGGCCC ACAACAAATC 


i a r\- 




AGCTCCGACC AACAGGGTCC 


ACACCACCAT 


GGGTCTACAC 


AbAi Lt*Alj<j 1 




i ci nn 
ljj UU 




AGCCTACAGT GACGCGGGCC 


CCTGTGGGGT 


GGTCCCTGCA 




PTPiPaPTflP, 


ICICfl 

IjjoU 




GTCCCAGTGG GGTGATCCCT 


GCGGGTCGCG 


TCCCTGCGAG 






1 C/tTfl 

J. DfiZU 


10 


CCCTGCGGGT CGGGTGCCTG 


CGGGGTGGTC 


C CTATGGGTL. 




PPPTP PPPTP. 


i an 




CCTGCGGGGT GGCCCCTGGG 


AATCGCGTCC 


CTGCGGGTCG 


c p tp_p r* tppp 




Ij D4U 




TGGGGATCGC GTCCCTGCGG GTCGGGTGCC 


TGCGGGGTGG 


pp p» p* mp cc f y\ 


TPPPPTPPPT 


1 C C Art 




GCGGGTCGGG TGCCTGCGGG 


GTGGTH- 1 lb 


TGGGTCGCGT 


C P 1 P* TP TPP P P 


TPPTPPPTPT 


13 0 0 U 




GGGTCGCGTC CCTGTGGGGT 


GGCCCCTGCG 


GGTCGCGTGG 




CCCL'VC CCXCITC 

VjuU 1 V- VJUU 1 O 


±3 / ill 


15 


CCTGCGGGGT GGTCCCTGTG 


GGTCGCGTCC 


CTGCGGGTCG 






1 Q.T Q A 
10 / OU 




TGCGGGTCGC ACCCCTGCGG 


CGTGGTCCCC 


CCGGGATGGG 


TPPIIPPPAPH 


apppppptpp. 


1 CQAfl 




AGGCCGAGCC CGCGCCCGCC 


CGCGGCGCCA 


AGATGGAGGC 


A (~1T1 A APPP.PP 


cx^mc recce 


1 CQArt 

Id? uu 




CCCGCCACCG CCCGCGCCGC 


CCGCCTGACG 


CCGCCGTTGC 






i Caen 




CGGCCGCCCC TCCCCCGGCC 


CTCCCCTCCC 


CCCGCCGTAA 


CGTCCTGACG 


CTCCGCAGGG 


t c r\ o pi 


20 


ACCCCTGACT GGACGGCGGC 


GCGTGAGCGG 


AGCGAGAGGC 


CTCGCCGLGG 


bGGGGCLGLG 


T c n o n 




GGCTCGCCGG CGCCGCTTAC 


CTGGGGCCGC 


GCCGGGCCTG 


CTTAGGCACC 


CGGCGGGGGC 


10 l*tU 




GGCGGCGTCG GGAGCTGCGG 


CGGCGGCGGG 


CGGCGGCGGC 


GGCCGCGGGC 


TTCGCTCCTT 


i c t n n 




GTTGGGGATT CGGCGGCGGC 


GGCGGCGCGG 


GCGCGCGCT1 


CCTAGTGACG 


CAGGCGGCGG 






GGCCGCGCAC GCACGGGGCT 


GGGAGGGCCG 


bAL AU I 1 A 1 1 


TGGCGCTCGC 


■ GGAGGAGGAA 




25 


GGCGGGGCCG TGAAATAAGG 


CCCGACGGGC 




GTGCGCGGAC 


CGACACTGTC 


16380 




AGCTCCTAAC GCCGCAGGTT 


CCTCCTGGTC 


CCCGAGGCCC 


CCGGTCGGGC 


GTTGCCTGCC 


16440 




CCGCGCGGGC GGCCGGGCCG 


AGGGACGATG 


GTCAGTGGAC 


GGACGGCGCC 


AGGGAGCAGT 


16500 




GCCCACGCGC GGCAGGGCGG 


TACCTTCAGG 


CCTCCAGGTA 


CGGGCGCTCC 


TCGCCCGGAC 


16560 




GCTGCTGTGT GTGAATGGGC 


GCGAGGGGAC 


TCCCCTGCGG 


GGCGGACGCC 


TGAACACGAG 


16620 


30 


GCTGTGGAGG AGGACGCTGT 


AGGGTGCGCG 


GACTCACGCG 


GAACATGCCA 


GAGGCTCAGC 


16680 




CAGCCACGGC GCTCCCAGCG 


TGGAGGGCGA 


GGGGCATCCG 


GGAGCGGCCG 


GGAGGGCTCG 


16740 




GTCACCCCTC AAGCTGTCAC 


CCCAGTCCCA 


. CAACCAGCAC 


CCCGATCCTA 


TCGCAGTCCC 


16800 




ACAGCCGACA , CCCCGATCCC 


ACCCCTGCCC 


AACAGCCGGC 


ACCCACCCCA 


ATCCCATAGC 


16860 




TAACACCCCG GTCCCACCGC 


TGTCCCACGG 


CCGGCACCCC 


GATCCCACCC 


■ CAGTCCCGCA 


16920 
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GCTGGCACCC CGATCCCACC CCAGCCCAAC AGCTGGCACC CACCCCGATC CCACCGCTGT 16980 
CCCACAGCCG GCACCCCGAT CCCACCCCAG TCCCGCAGCC GGCACCCCGA TCCCACAGCC 17040 
GGCACTCACC CCGATCGCAT AGCATAGCTG ATACCCCGAT CCCACCCCAG TCCCATAGCC 17100 
AGCACCCCGA TCCCACCCCA GTCCCATAGC CAGCACCTCG ATCCCATAGA TGACACCCCG 171'60 
5 ATCACGCCCC AGTCCTATAG CCCGCACCCC GATCCCACCC GAGTCCCGCA GCCGGCACCC 17220 
CATCCCACCC ATGTCCCACA GTCGGCACCC CGATCCCACT CGGATCCGGC AGCCAGCTTG 17280 
GATCCTGTGG CCCTCCTCCA GCCCCCAGGG CTCATTTATA TGTTTTATTG GCAGAGGCTG 17340 
GGGCTGGCTC TGTTGGCCTC TGTGCTGGGT TTCTTCCTCT GCACCGCAGG ACTGGCTCTC 17400 
CTGACCTCTC CAGGTGTCAT CGAACACCCT TGTGCTTGCT GTCACCCGCT GCCTGTCTGC 17460 
10 ■ AGGATCCCGG ATTCCGTATC AGGGGACCGA AATTAGTCGG AAAATAGGAA GCAGGTGCTC 17520 
GCTTGGATGG AACCCTGACC CTGTGCTCAC ACTTGTAGGA GGAGGGCTCT GCAGGCCGCC 17580- 
TCCCGGAACG GGAGGTTCCC AAGCCACTGC ACTTCGGAGG GGCTGTAATT AGAGTTGCAC 17640 
ATTCATTCAG TTCCCAGTAA AGTAGAACGT GCTCCAGCCA GTGAGGAAAA GGTGTTTTTA 17700 
AAAATTAGAT TGGCCGAGTG CGGTGGCTCA TGCCTTTTAC CTCAACACTT TGGGAGACAA 17760 
15 AGGTGGGAGG ATCACCTGTG GCCAGGAGTT CAAGACCAGC CTGGGCAACA GAGCCTGTCT 17820 
CTGGGGAAGA ATAAAAAAAA AAATTGAGCC TTTGTCAGTG CTACTATTTT ATTATCTGGT 17 880 
AAATATGAGA GGGTTCACGC GGTCTATGTG TGTCATTTAT CTGAGTTTGC CTATCGTCAC 17940 
GTTTTGGAAA TAAATGTCAA TAAAGTCGAA GAGGAGTGCT GAGGGGGGCC TGGGGATGGG 18000 
, AGGGTGGCTA CATCATGCCT GTGTGTTGCG CAAGCCCACC GAGGTCGGCC TGGGGTGAGC 18060 
20 ■ CCTGGGGCCT GTTCTGCCTC CTTCACTCTG GGGCTCCAAG AGACAAACTG GGCAACAAGA 18120 
GAGAAACTCC ATCTAAAAAA AAAGAAAAAT CACCTCCAAG ATAACTTAGC TTTCTTCTGC 18180 
TGGCATAACA AATTATCTCA AACTTAGTCG CTTAAAAATG CAAATTTAGG CTGAGTGCGG 18240 
AGGCTCACGC CCATAATCCT AGCACTTTGG GAGGCCAAGG CAGGATTGCT TGAGGCCAGG 18300 
AGTTCGAGAC CAACATGGCC AGAACTGTCT CTTTTTAAAA AATGCAAATG TGTCCGGCAC 18360 
25 GGTGGCTCAG GCCTATAATC CCAGCACTTT GTGAGGCCAA GGCGGGCAGA TCACGAGGTC 18420 
AGGAGATAGA GACCATCCTG GCTAACACTG TGAAACCCCC TCTCTACTAA AAATACAAAA 18480 
AATTAGCCTG GCGTGGTGGC AGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 18540 
AGAATGGCGT GAACCCAGGA AGCGGAGCTT GCAGTGAGCC GAGATGGCGC CACTGCACTC 18600 
CAGCCTAGGC AACAGAGCAA GACTCCGTCT CAAAAAATAA ATAAATAAAA CTGCAAATGT 18660 
30. ATTCTCTAAC TGTTCTGTAG GTCGGAAGTC CAGCCCAGCC TCACTCCGCC AAAATCAGGG 18720 
TGTCTGCAGG GCCGATTGCT TTTGGAGCTC CAGGGGAGAA GCTGTTCTGG CCTTTCCAGT 18780 
TTCTGGAAGC ACTTGAGCCC CTTGTCTCGT GGCCTATCCC ACAC CTGAAA GCCAGCCAAA 18840 
GCCAGTTGAG TCCTCACCCT GTTGGCCCCG ACACTGATCT CCTGCCTCCC TCATCTGCTG 18900 
TCAAGGCCCC TTGTGATGAC ATGGGGCCAC CAGCTGGCCC AGGGCACCTC CTGTCAGAGT 18960 
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CCGCCGACCA GTGACCTTCA TTCCATCTGT CGCTGTAATT CCCCTTTGCT TGGAACCAAC 19020 

GTTCACAGAT CCCAGGGGTT AGGATGTGAA TATCTTGGGC AGGGCTGTGG GGGGGCTATT 19080 

CTTCCTTCTA AAATATTTAT CATTTTTGTT TTGGGGATTT TTTTGGTTTG GTTTTTTTTG 19140 

AGACAGAGTC TCGCTCTGTC GCCCAGGTTG GAGTGCAATG GTGCAATCTC AGCTCACTGC 19200 

5 AACCTGTGCC TCCGGGCAGA GGTGAGCCAC TGCACCAGGC CTGTTTTTGT TTTTGTTTGT 19260 

"TTTGTTTTGT TTTTGAGATG GAGTCTCGGC CGGGCGCGGT GGCTCACGCC TGTAATGCCA 19320 

GCACTTTGGG AGGCCGAGGC GGGCGGATCA CGAGGTGAGG AGATCGAGAC CATCCTGGCT 19380 

AACACGGTGA AACCCCGTCT CTACTAAAAA TACAAAAAAT TAGCCGGGCG TGGTAGCGGG 19440 

CGCCTGTAGT CCCAGCTACT CGGGAGGCTG AGGCAGGAGA ATGGCGTGAA* CCCGGGAGGC 19500 

10 GGAGCTTGCA GT.GAGCCGAG ATCGCGCCAC TGCACTCCAG CCTGGGCGAC AGAGCGAGAC 19560 

TCCGTCTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGAG ATGGAGTCTC ACTTTGTCAC 19620 

CCAGGCTGGA GTGTAGTGGC GGGATTATAG GTACGCGCCA TCATGCCCAG TTACTTTTTG 19680 

TATTTTTAGT AGAGACAGGG TTTTACCATG TTGGTCAGAC TGGTCTCAAA CTCCTGATCT 19740 

CAGGTAATCC ACCCGCCTCA GCCTCCCAAA GTGCTGGGAT TACAGACGTG AGCCACCGTG 19800 

15 " TCTGGCCATA TTTATTAACT ACAAAGGGAA AGATGATAAT TTTTTTTTTT GAGATGGAGT 19860 

CTCACTCTGT CACCCAGGCT GGAGTACAAT AGCGTGATCT TGGCTCACTG AAACCTCTGC 19920 

CTCCCAGGTT CAAGCGATTC TCCTGCCTCA GCCTCCCAAC TAGCTGGGAT TACAGGCGCA 19980 

CGCTACCAAG CCCAGCTAAT TTTTGTATTT TTAGTAGAAA CGGAGTTTCA CCATGTTGGT 20040 

GAGGCTGGTC TCGAACTCCT' GACCTTGTGA TCTGCCCACC TCGGCCTCCC AAAGTGCTGG 20100 

20 GATTATAGGC ATGAGCCACT GCAACCGGCT GAAAGATGGT AATTTTAAAG TAGAGAAACT 20160 

GGGTTGGCTG GGCATGGTGG CTTATGCCTG TAAGCTCAGC ACTTTGGAAG TCCAAGGCAA 20220 

GAGGATCGCT TGAGTCCAGG AGTTTGAGAC CAGCCTGGAC AATATAGCAA GACCCCATCT 2028 0 

CCGCAAAAGC TAAAAAGTTA GCCAGGTGTG GCGGCACATG CCTGTAGTCC CAGCTACTCA 20340 

GGAGGCTGAG GTGGGAGGAT CACTTGAGAC CAGGAGGTCA AGGCTGAAGT GAGCTGTTAT 20400 

25 TGTGCCACTG CACTCAGCCT GGGCAACAGA GCGAGAGTCT GTCTCCAAAG GTAAAAAAAG 20460 

GTCCAGGCAC AGTGGCTCAC ACCTGTAATC TCAGCACTTT GGGAGGCCGA GGCGGGCAGA 20520 

TTCGTTGAGG TCAGGAGTTC AAAACGAGCC TGGCTAAATG GTGAAACCCC GTCTCTACTA 20580 

AAAATACAAA AAAATTAGCC AGGCATGGTG ACGGGCGCCT GTAATCTCAG CTACTTGGGA 20640 

GACTGAGGCA GGAGAATCAT GTAAACCCAG GAGGCTGAGG TTGCAGCGAG CCAAGATCAT 20700 

30 GCCACTGCAC TTCAGCCTGG GCGACAGAGC AAGACTGTCT CAAAACAAAA CAAAAGAATC 20760 

TTGAGTCCTG AGTTCCTCTA AGGGAAATTC CAGGCACCTC GCCAGCCTTG ACAGGCAAAG 20820 

GAACAATCTG ATGAGGAAGA AGATAGAAAC AGCTTAAACA ATAGTCTCCC GGCCGGGGGC 20880 

AGTGGCTCAC GCCTGTAATC TGAGCACTTT GGGAGGCCGA GGCGGGTGGA TCACAAGGTC 20940 

AAGAGATCAA GACCATCCTG GCTAACATGG TGAAACCCCG TCTCTACTAA AAATACAAAA 21000 
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AATTAGCCGG 


GCGTGGTGGT < 


GGGTGCCTGT 


AGTCCCAGCT 


a p mp rrirz A c c 




21060 


AGAATGGCGT 


GAACCCAGGA 


GGCGGAGCTT 


TCAGTGAGCT 


LjAvjAILvjCvjL 






CAGCCTGGGC 


GACAGAGCCT 


CGAGACTCCA 


TCTCAAAAAA 


aaaaaaaaa'p 






TGGTGGCTCA 


CACCTGTAAT 


CCCAGCTACG 


TGGCAGGCTG 


AGGG AGG ALj A 


Arpppprpmpj\ ?! 


0 1 9 A Pt 


CCTGGGAGGC 


GGAGGTTGTA 


GGGAGCTGAG 


ATCGCACCAC 


TGLAL 1GGAG 


GG J, GGGG nAL 


n r.T p\Pi 


AGAGCGAGAC 


TCTGTCTCAA 


AAAAAAAAAA 


AAAAAACAAA 


AAAAC AAT AG 


TGTCGCAAGT 


1 1 1 c n 


AAGTCAGAGT 


CACAAGGTGT 


TTTGATTCCC 


TGTGGAAACT 


AAAATATAAC 


AGC TT AAC AT 




ATGTTCTTGA 


GTTATTTTTC 


AGAAACTTGG 


ACATCCACCA 


fT*TT\f^f* A A A A T* 

G GTGG AAAAT 


GC.TvjAGt.TAG 


*)i /inn 


GAACAGTGGC TATAATTTCA GCCTTTTGAG AGGCCAAGGT 


GGAAGGATL A . 


GTTGAGGGGA 


oi c/in 


GGAGTTAGAG 


ACCAGCCTGG 


CCAACATGGT 


GAAACCCCGT 


/-i m/™" m A i*"**T* TV TV A 

C TC T AGT AAA 


AAT AG AAA I A 


oi c nn 
Z -L □ u u 


TTAGCTGGGC 


ATGGTGGTGC 


AACCTGAAAT 


CCCAGCTACT 


TGGGAGACG1 


A /T"TVT' P" 1\ /T* 

AGL IGGGAGG 


z lo o u 


ATCGCTTGAA 


CCTGGTAGGA 


GGAGTTTGCA 


GTGAGCTGAA 


ATTGTGCCAC 


TGGAGTLTAG 


zl / Z U- 


CCTGGGCAAC 


AGAGTGAGAC 


TCTGTCTCAA 


AAAATAAATA 


AATAAAAAGA 


GAAAAAAGTG 


Zl /oU 


TTGCCTGCAG 


GCCGGGCACA 


GTGGCTCACG 


CCTGTAATCL 


CAACACTTTG 


GG AGGC C GAG 


T 1 o a r\ 


ATGGGCAGAT 


CACCTGAGGT 


CAGGAGTGCA 


AGAACAGGG 1 


GGCCAACATG 


/too a A a /""O Of* 

GTGAAACCGC 


ziyuu 


ATCTCTACTA 


AAAATACAAA 


AGTTAGCTGG 


GTGTG I ALA1 


GTAGTCTCAG 


CTACTTGGGA 


zxy bu 


AGCTGAGGCA 


GGAGAATCTC 


TTCAACCGGG 


GAGGTGGAGG 


TTGCGATGAG 


CTGAGATCAC 


Zz U 


GCCACCACAC 


TCCATCCAGC 


CTGGGTGACA 


GAGTGAGACT 


CCATCTCAAA 


GCAAAAAAAG 


zz UoU 


AAACATAGGT 


GGGACCCTTG 


GTGTGTCCTT 


AGGGCATGAT 


GGTTGAGGTA 


TACTGGTGGi 


Z214U 


CCTGTCATGT 


AAAAGAAAAC 


GAGCCGACTC 


TGTGTCTACT 


GGAGAAAGCA 


CTGCATATAT 


zzZUU 


CAGCCACAGT 


CAATACCTCG 


CTTCTGCAGG 


GACGGTGGCT 


GCCAGAGTGG 


GAGGGTTTGG 


1 o O C Pi 

ZzZ bu 


TAGCACCCAT 


GTCGTGGAAT 


CACAATGTTG 


TCGATAGCTC 


TGGGGTCTTG 


m A f~* A A A A •"TV" 

TACAAAATGG 


Zz j ZU 


CAGATCCTCC 


CATTTGGTTT 


CCTTATGGGA 


AGGATCGCAG 


rn a pfliTi rp a a rn a 

TACTATAA 1 A 


GAlGoGLl Hj 


Z Z J 0 u 


TGCAAGGGAT 


CATTATACCC 


TTTTCTCTTT 


TTTTGCTTTT 


L L 1 IGAijAv-A 


papTTTraPT 


z z y u 


CTCGTCACCC 


AGGCTGGAGT 


GCAATGGCGC 


GATCTTGGCT 


UAL. iVjl-AALL. 


rn r» p a p p Tr*P T 




GGGTTCAAGT 


GATTTTCCTG 


GCTCAGCCTT 


CTGAGTAGCT 


GGGATTACAC 


ATGCCCGCCA 


22560 


CCAGGCCTGA 


CTTATTTTTG 


TATTTTTAGT 


AGAGACAGGG 


TTTCACCAAG 


TTGGTCAGGC 


22620 


TGGTCTTGAA 


CTCCTGACCT 


CAGGTGATCC 


ACCCACCTCG 


GCCTCCCAAA 


GTGTTGGGAT 


22680 


TTCAGGCATA 


AGCCACCAGG 


CCCAGCCTTT 


CTTTCTTTTT 


AAAATTAATC 


TTTGTTTAAA 


22740 


AATACTCTCA 


TTTTTTATTT 


AATTGTAGCA 


CTCCTAGATC 


CCGAAAGCAG 


ATACACTCTT 


22300 


GTTATGGGTC 


TGATTCTTTT 


CATTGCTTCA 


CGCCTTAGAG 


GATATTGTCC 


AATACTGGAT 


22860- 


AAAAGTTTAC 


TCAGGTCTAC 


TTCCACTTTA 


. ACGGGGATGG 


CTGAATATCT 


CTTCCACTTG 


22920 


GCTGTTTGTT 


1 TATAATGAAC 


TGACAAACAT 


1 ACAAATTTTC 


TTGAGTTCTG 


TGAGACATTC 


22980 


TAGTAAATCA TCTAACCTGA 


. AGAGCAGGTT. GTGAGAACCC 


CTGATTTAGA 


AAGCCCAGTG 


23040 
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GTCATAAATA 


TAAGTGGCTC 


TGGACTCbC 1 


Uooob 1 V— . i. 


Onrtw 1 \Jl \3\J\3 


P APTP PP.TT A 






GGATTGAGCC 


CTTGTAATTT 


GTAGGATCTG 


ACACACACTC 


P A CCZ A A CZCZC A 


PTP.T P A P 71 A T 


1 1 fin 




TTACCTGTAT 


TATATTGGAC 


ACCCAGTTAG 


CGTTTGGAGA 


All VjVj 1 1 \J\- 1 


PPT A TAP A A A 


4 J £ Z U 




AATACCAAAT 


ATTTTATGTC 


AGGGGAGTGA 


AAGAAAAAAC 


AAA AArTTHH 


U-UbbbCbL.Ljb ■ 




5 


TGGCTCACGC 


CTGTCATCCC 


AGCACTTTGG 


GAGGCCGAGA 


C CbbC A I C 


ACbAbb iCAb 


6 J J <tU 




GAGATCGAGA 


CCATCCTGGC 


TAACAGGGTG 


AAACCCCATC 


mnmn /"*m7\ a a a 

T C T AC T AAAA 


AT AC AAAAAT 


4 J4UU 




TAGCCGGGCG 


TGGTGGCGCG 


CGCCTGTAGT 


CCCAGCTACT 


CGGGAGbC l\j 


AGGCAGGAGA 


^ J 4 bu 




ATGGCGTGAA 


CCCGGGAGGC. 


GGAGCTTGCA 


GTGAGCCCAG 


ATCGCGCCAC 


C GC AC TC C AG 


1 Jb^U 




CCTGGGCGAC 


AGAGCGAGAC 


TCCGTCTCAA 


AAAAAAAAAA 


PT\ A A A ?l ^ S A A 

CAAAAAAAAA 


AAACAAAAAA 


n-jCQn 


10 


AAAAAACCCA TACACTTTAA GGAAAGCAAC TGACAGCATT 


TGTT AL C ACT 


C* AT* A A A ft T"T lr P 

GA1AAAATTT 






GAGCTTTGAA GTAAGAATAA CAATTTTGCC 


7v mrpr^'VC' f* 1 r* 1 
Al 


GGC C AAG AAA 


AAAAAAAGAA 


z J / UU 




.TTTTGCCATT 


GTGAAAGGCT 


TCCCAGTACT 


r V r VC r VC A TP A CI 


CTTGACGGTG 


ATATTAACAA 


IS /bU 




ATAACTTTTT 




TTGAGATGGG 


GTCTTGCTCT 


GTCACCCAGG 


CTGGAGTGCA 


O "3 Q O Pi 




GTGGTTCAAT 


CTCAGCTCAC 


TGCAACCTCC 


GCCTCCCAGG 


TTC AAGC GAT 


TCTCCTGCCT 


noon 
i J00U 


15 


CAACGTCCCA 


AGTCGCTGGA 


CTACAGGTGT 


GCGCCACCAC 


GTCCAGATAA 


TTTTTGT ATT 


T "3 Q A A 




TTTAGTAGAG 


ATGGGGTTTC 


ACCATGTTGC 


CCAGACTGGT 


CTCAAACTCG 


TGACCTCAGG 


24000 




CGACCCGCCC 


ACCTCGGCCT 


CCCAAAGGTG 


GGAGGCCTTG 


CTGGGATTAG 


AGGTATGAGC 


O A A C Pi 

J 4UbU 




CGCTGCACCT 


GGCCTCTTGT 


CCTTGTGTTT 


TGCAGTGATG 


C AATGAC C AT 


GTCT 1 AC AT 1 


O A 1 I Pi 




TGCAACCAGA 


AAAAAAGGTT 


AGTGTAACAA 


TGTTTATCCT 


GTTTTTCCCA 


t\ /^^^ tv /*■*' tv tv fry 

GAGTAGACAT . 


1v) 1 OA 

241o0 


20 


TATGAAGATT 


AAAAAAATTT 


GAAAGTGTTT 


TGAATATAAT 


AAACTATGCT 


AT AC AC AC AA 


Z 424U 




CATTTTGGTG 


ACTAGAAATA 


CAAGTTTATT 


GTTTGTTGTT 


T GTT GAG AC A 


ppnppp | pr , r"i< 

GGGCCCTGCT 


1 /l 1 A A 

Z4 jUu 




CTGTCTCCCA 


GGCTGGGTGG 


CACAATCATG 


GCTCACTACA 


GTCTTGAACT 


CCTCjGGCTTA 


0 /I 1 C A 

Z4 J b u 




AGCGATCCTC 


CCACCTCAGC 


CTCCAGAGTA 


GCTGGGACTG 


L. AAAL. 0 AVjV_ A 








TGGCTAATAT 


TTGTATTTTT 


TGTAGAGATG 


GGGTTTCACC 


ATGTTGCCCA 


GACTGGTCTC 


^ fi O U 


25 


AAACTCCTGG 


GCTCAAGCAA 


TGCTCCTGCC 


TCGGCCTCCC 


AAAGTGCTGG 


GATCACAAGT 


9 a c;yi a 




ATGAGCCACT 


GCACCCGGCT 


GAGTTTCTGT 


TGTTTTAAGC 


CGCTTCATTT 


GTGGTACTTC 


24 600 




TTACAGCAGT 


CCCAGGAAAC 


TGAGCAACTG 


CAGAACATCA 


AAATTGTTTT 


TCTTCAGCAA 


24660 




AAGGAGAAGC 


ACTTGTGGTT 


GGCACCAGCT 


TTTCCTGTGC 


TCACTTCTGC 


ATGGCCGCAC 


24720 




CTTTGCCCGA 


CACGAGTGCA 


CAGCAGGCTG 


TGGGGGAGCA 


ACTGGTTGAG 


TCAGGCCTCC 


24780 


30 


ACTTGTGCCG 


TATCCCCACC 


TGCTTTGCTG 


GACACGCCTG 


TTTGGGGGGC 


ACCCACTGCT 


24840 




GCCCCAGACA CCAAGCAAGC ACCAGCTGTG TCCAAAACTT 


ACAGTCACTG 


TCTTGGCCCG 


24900 




TTTTGTGCTG 


CTGTAACAGA 


ATGCCACAGA 


CTGGGTAATT 


TAATACAGAA. CAGAAATTTA 


24960 




TTTCCTCAAA 


GTTTTGGAGG 


CTGGGAAGTC 


CAAGAGCAAG 


GGGCCATCAG 


GTCAGGGCCT 


25020 




GGTCTCTGCT 


TCCACGATGG 


CACCTTGACC 


ACCGTGTCCT 


■ CACGTGGTCA 


GAGAGAGCCC 


25080 
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ACTCCCAGGA 


GCCCTTTTAA 


1 Ab ivaL AbAA 




CTGCGGTTAA 


GTTTCCAACA 


u J J. *i \J 




CGTGAACTTC 


GGAGGTGACA 


LATILAbAlL 




CTCTAGGCAG 


AGTGTCTGAT 


25200 




GTGGTTTTAA 


AATACGTTCA 


CAGACTGGCC 


GGGCACTGTA 


GCTCACGTCT GTAATCCCAA 






CAGTTTGGGA 


GGCCAAGGTG 


GGTGGATCAC 


CTGAGGTCAG 


GAGTTCAAGA 


CCAGCCTCAC 




5 


CAACATGGTG 


AAACCCCATC 


TCTACTAAAA 


ATACAAAATT 


AGCCAGGTGG 


TGCATGCCTG 


7 ^flO, 
4 jJuU 




TAATCCCAGC 


TACTCGGGAG 


GCCGAGGCTG 


GAGAATCGCT 


TGAATCCAGG 


AGGTGGAGGT 


Z.D44U 




TACAGTGAGT 


CGAGATCATG 


CCATTGCACT 


CCAGCCTGGG 


CAACAAGAGC 


GAAACTCTGT 


ZjjUU 




CTCAAAAAAT 


AAAATAAAAT 


AAAATACATT 


CACAAGGCCG 


GGCACTGTGG 


CTCACGCCTG 


Z D3oU 




TAATCCCAGC 


TACTTGGGAG 


ACTGAGGCAG 


GAGAATCGCT 


TATAACCTGG 


GAGGTGGAGG 


Z jOZU 


10 


TTGCAGTGAG 


CTGAGATCAC 


ACCGCTACAC 


TCTAGCTTGG 


GCAACAAGAG 


TGAAACTCCG 


£ JDOU 




TCTCAAAAAA 


GTAAAATAAG 


GCCCTGCAGG CATGGTGGCC 


CACACCTGTA 


ATCCCAGCAC 


6 J / 4U 




TTTAGGAGGC 


CAAGGCGGTC 


GGATCACGAG 


GTCAGGAGTT 


CGAGACCAGC 


CTGGCCAACA 






TGATGAAACC 


CCGTCTCTAC 


TAGCCTAGCC 


AACATGGGGA 


AACCCTGTCT 


CTACTAAAAA 






TACAAAAATT 


AGCCGGGCAT 


GGTGGTGCGT 


GCCTGTAATC 


CCAGCTACTC 


AGGAGGCTGA 


Z Dy ZU 


15 


GGCAGGAGAA 


TCGCTTGAAC 


CCAGGAAGCA 


GAGGGTGCAG 


TGAGCCAAGA 


TTGCGCCGCT 


z jyou 




GCTCTCTAGC 


CTGGGCGACA 


GAGCGAGACT 


CCATCTCTAA 


ATAAATAAAT 


AAAATAAGAA 


Z 




AATAAAATAT 


GTTCACAAAT 


CCTTTGACAT 


TCCTCACCTC 


AAAAGCTGGA 


ACCCAACTCC 


ibiuu 




CTCCTAAGCA 


TGAGTCTTCT 


CAGTGACTCA 


CTTCTAACAG 


CAGAACTTAC 


ATGGTTCCCC 


Z OlbU 




ACACCCAGAG 


GACATTGGGT 


TCCTCCCAAT 


ATCCCCCCAC 


CCAGCGACCC 


CCACCCAGGT 


Z 0 Z Z U 


20 


CGCTGGCTTT 


GGGTCCCCCA 


GAGCCATGTT 


TCAAGGACAC 


TCAGGCAGCC 


CCTGGATGTC 


• Z 0 Z O U 




CATGTGGTAA 


GGAATGAAGG 


CCTCCTGCCT 


GCAGCCTCGG 


GAGGGAGCAT 


TCTCAGAAGA 


OCT jn 

Z 0 j 4 u 




GGATGCCCCA CCTCCTGCCC AGCCTTCAGA TGGCCAGGAC 


CTCGTCCAAC 


GTCCTGACTG 


Z DfiU Lf 




CAACATCATG 


AGAGACTCCG 


AGCCAGAAAC 


CCCCAGGTTT 






Z O *± O v 




ACTGACAGAT 


AATGTTCGTT 


GTTAATTAAG 


GGGTGACTTG 


TCACACACAA 


TAGGTCACTA 


2 6520 


25 


AACAGCTCTG 


TCTGGCCTCC 


CAGGAGGAGC 


CTGCCTTTCC 


TTTTCTTCAT 


GGGAAAAGTG 


2 6580 




CGATCAGTTT 


GTGAAGGAAT 


GTCCGCCCCC 


ACTTGATGCC 


AGAGGCTCCA 


CATGGTGACT 


26640 




GTCATAAACT 


CCATCTGCCC 


TCAGTGCCTT 


GCCAGCACCC 


GGCCTGCGAT 


CAGCTTGGTC 


26700 




TTGCGGGAGG 


CCAAGGCCCA CGTGTGTTTG 


TGTGTGGTGT 


CTGTGTCTGC 


GTGCCCATGC 


26760 




ATGCCCAGGG 


TACAGGGATG 


CCATATACAA 


ATTCTTTCAA 


TGTTGTATGT 


GGCATGTGTG 


26820 


30 


TGTCTGTATG 


CCCAGGATAC 


AGGGATGCTA 


TATACAAACT 


CTGTTTTTTC 


GTTTTTTTTT 


26880 




TTTTGAGACA 


GAGTCTTGCT 


GTTTCGCCCA 


GGCCGGACTG 


CAGTGGCGCT 


ATCTCGGCTC 


26940 




ACTGCAAGCT 


CCACCTCCCG 


GGTTCACGCC 


ATCCTCCTGC 


CTCAGCCTCC 


TGAGTAGCTG 


27000 




GAACTACAGG 


CGCCCGCCAC 


CACACCCGGC 


TAATTTTTTG 


TATTTTTAGT 


AGAGACGGGG 


27060 




TTTCAGCATG 


' TTAGCCAGGA 


. TGGTCTTGAT 


CTCCTGACCT 


CGTGATCCAC 


CCGCCTCAGC 


27120 
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CTCCCAAAGT GCTGGGATTA CAGGCATGAG CCACCACGCC TGGCCTACAA ACTCTTTCTT 27180 

TTTTTTTTTT TTTTTTTTGA GATGGAGTCT CACTGTCTTC CAGGCTGGAG TGCAGTGATG 27240 

CGATCTCAGC TCACTGCAAG CTCCACCTCC CGGGTTCATG CCATTCTCCT ' GCCTCAGCCT 27300 

CCCAAGTAGC TGGGACTACA GGCACACACC ACCACGCCCA GCTAATTTTT TGTGTTTTTA 273.60 

5 GCAGAGATGG GGTTTCACCA TGTTAGGCAG GATGGTCTCG ATCTCCTGAC CTCGTGATCC 27420 

GCCCGCCTCG GCCTCCCAAA GTGCTGGGAT TACAGGCGTG AGCCACTGCG CCCAGCCTGC 27480 

AAACTCTTT.C AATGTCTTTC TTTTCTCTCT CCTGCCATCT TCTCCCTTGC AGATTTCTTT 2754 0 

TGTCTCTACG TCTTCCCCAG CTGAGTCCGA GGTCCTGACT TGCCCACGCT CCCTGGACTG 27600 

GAGGAGAGGT GATAGCAAGA GCTCCTTCAA GCCCAGGAAT GCCACCAGGG CTGCCCCGGG 27 660 

10 AGAGGAGGAA. GCTGGGTCTC TCGGGGTTGT GGGGACCAGA CACCCTTCTA AGACATGGAC 27720 

TCAGCACAGA AAGTCTAGAC ATCCACTACA AACACATCTC CCTCCTAACA GGGGGCCCCT 27780 

GGGCACCCCA AGTGGCTGTT TGGTGGGACA GGCATGTCCA TCAGTCAGAA TATCTTTATT 27840 

TTTTATTTTT TATTTTTTAT TTTTGAGAGA GTTTCACTGG AGTGCAATGG CACGATCTCA 27900- 

GCTCCCTACA ACCTCCGCCT CCCAGGTTCA AGCGATTCTC CTGCCTCAGC CTGCCACGTA 27960 

15 GCTGGGATTA CAGGTGTGAG CCACCACACC CAGCTAATTT TTTTTTTTTT TTTTTGAGAT 28020 

GGAGTCTCGA GGCTCTGTCG CCCAGGCTGG AGTGCAGAGG CGCGATCTCA GCTCACTGAA 28080 

AGCTCCGCCT CCTGGGTTCA CGCCATTCTC CTGCCTCAGC CTCCCGAGTA GCTGGGATTA 28140 

_ CAGGCATGAG CCACCGCGCC CGGCCAATTT TGTATTTTTA GTAGAGACAG GGTTTCACCA 28200 

TGTTGGTCAG GCTGGTCTTG AACTCCTGAC CTCAGGTGAT CCACCTCCCT CGGCCTCCCA 28260 

20 AAGTGCTGGG ATTACAGGCC TGAGCCACCA CGCCCAGCCC AGAATGTCTT CTTACTTTTT 28320 

ATTACTCTGT CCCCCATCCT GGGTCCAGAC CTGTGACCGT GAACAACCGG CTGCCCAGGG 28380 

GTGAATGGGG TGAGTGGGGT GAGTCCACAG AACAGTGGGG TGCAGCCCCA ■ GGGGTCTCGT 28440 

AGCACCTGCC CCCAGGTCAG GAAGTCCCAC AGCCTAGAGG CTCCAGCCTC AGATGCATAC 28500 

ATATGTAGGC CCTGCCCTTT CCTCCTGAGC GGCGGGCCAC AGAGTCCTGA ACAACAGGAA 28560 

25 GCCCCTGAGG AGGGCTCCGC CCTGAGGGAG GGCAGGGGAG CCCCCGCCAG CCCCACCCAC 28620 

AGCAGCGGGC CCTGCCACCC CCCACCCTGA CACCTCACCC CTTGGATTCC AGAGAGGAAA 28680 

GTGGGCTTGT GTGTAGTTTA CATGCTCATA TCTTAAAATC ACCGTTGTCA ATAGAACAAT 28740 

TCATAATAAT GATGATAAAA TAAGATTTAT AACCAGCTTC AGTCTGGAGA TACACACAGA 28800 

GCAGATCTTC ACTCCCAGAC AGGGAGCCCG CAGCTGCCCC CGACCCCACA GGTGCAGGAC 288.60 

30 ACACACAGAC AGTTCAACCA TGTCTTAAAC ACACAGGTGT TTATTTAATT GTTCATTTGA 28920 

TTGAATTTTT AAGTTCACTT TACTACGTGG ATGAGATGGG TGCATATTAC AGTAGGCTTT 28980 

CGCTATGAGC GCTGCCACCA TGAGGAATAT CCCAGCCCTC AGTTCTGCTT CCCTTTCTGA 29040 

GTCCCACAAA AGCCAGATGT GGACAGCCTT GGGTTCCCAT CCCAGCTGGC TGCTCCTTCT 29100 

GGGGCTGTCT TGGTGGGGAG AGGGAGATGG GGCAGTGGGT CCCTGCTGAC CCCTGAGCCC 29160 
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TGCAGGGGTC 


AGGATCCTCC 


CGTGGTCCCT 


GGGTGTGGCT 




^— 1 ovsCftAj J. 






CCGGCCAAGG 


CCTCCCGCAG 


GATGGAAGTT 


GAGGGCCCTG 


LL 1 L 1 Loo 1 L 


PTA ap a p a ap 
L I AALr AL AAL 


0 QO Q Pi 

z y z 0 u 




TCAGCCGCCC 


CCTTCACACT 


TTACAGCAAG 


GGGCCAGGCA 


LLALL I 1 ILL 


P AT'PPPPPTUT 1 

LAI LLLLL 1 1 


1 Q i yi n 




CCGTGGAGAA 


GTGGGGGATG 


CTGCAGTGGT 


ACAAAGACAG 


LLTLLLLLAL 


LLLLATLL TL 


z y 4 u u 


5 


CAGCTGACCG 


TCCTCCAAGG 


CCAGCACTGG 


GCGTCCAAGG 


P"» A A A P"* A AOOA 

GAAAGAAGGA 


a pt^p appppt\ 

altlalllla 


zy4oU - 




GAGGGTGTGG 


GCAGGAGAGG 


CCTGGAGTCA GGCCTCCACC 


CACAGCCCCC 


TGTGGGTGCC 


29520 




AAGTGGGAAG 


GGTGTTGGGG 


CTGGCTTLGL 


TV APT TT 1 A iT* 

AALLi 1ALLL 


GCTGCCCTTC 


CAACACCTGG 


29580 




ATCTGTGGGC 


AGCGGTC.CCA 


CAAAATCCCC 


LTTGGLlL 1 L 


CCTGAGGAGG 


ACTTGTGGGT 


29640 




GCCGCTTCCA 


CCAGGGCAGA 


GGGCACAGGA 


GGGLLLALLA 


CTCCAAAGGG 


prnp<rm\ r>rr\f*< 

C T CT ALGGTG 


2y /uu 


10 


GGTCTTTCAA 


GGACATCTGC 


AAAGCCCTGG 


TGGLLALLLL 


CCTGGGCCAG 


AGGCTLTTTG 


2976 0 




GAACTCTTGC 


ACTTCTGAGT 


GGGGGACTGT 


CCATLL ILLL 


CACAACCTCT 


AGACCATGCA 


2982 0 




GCCTGCTCAT 


GGGTCCCTGG 


CAGAGAATGC 


ppT\ /""""PP"" PTT"* A 

LCAL 1LLLLA 


GCAGACTCAG 


GGlAGGllll 


29 oo 0 




GAACTGCAGG 


CTTCCAGGAA 


GGCCCAGGGT 


GTC CALL TLA 


CGCCAGGTGG 


TCTCAGAGGA 


29940 




•CCCCTGTGCA 


ACCACATTAA 


GGAAAGCTGC 


AGLLLLLALL 


CACCCGCCTG 


CCAGTTCAAC 


30000 


15 


AAGCACCGGC 


TGCACACGCA 


GGCTCCCAGG 


P^ A P* A fPP A Pr* 1 

CALL A 1 L ALL 


CCCCTCCCCC 


GTCGCCCCTC 


30060 




CCTCACGGGG 


AGCCCCTTCC 


CCCTGGAAAG 


AC ALL ALL I A 


CTGTAGCCTC 


GCCTGCTGGC 


3012 0 




CAGGGGCGCC 


GGCTCAGAGG 


AClTGulLTL 


ALL i. LL AL L i. 


GCTGACCAGA 


CAGCCCAGCG 


30180 




TAAGGACCCG 


CGATCCCACG 


LUACLGCLL 1 


LLL1 1 1ACLA 


CGGTCACCAC 


CACCTCTCTC 


3024 0 




ACAGGGCCCC 


CGGGGGACCC 


ALLLLLLLLL 


pppptpptpt 


CTGCACCGAG 


GGACCGCGTC 


30300 


20 


TCACGCCCGG 


CGGCTCCTGC 


AGGGGAAGCC 


GTGGTCAGCG 


ACTCACCACG 


AGGACAGGGC 


' 30360 




AGGGCGGCTG 


AGTGCGGAAG 


AGAAGCATGA 


AGCTGGGGGC 


GGGGGTGGGG 


G AG G AGG AAC 


3042 0 




AAAAGTTGCA 


TCTAGACAGA 


GGTGAACGAA 


ACAAAACCAA 


AACCCGAACG 


TGTTCCGTCG 


30480 




CAGGATGGGC 


GCCGCCCGTC 


CCGGGCCCTT 


AGCCCGACAT 


CTCTTL1LLL 


ILL 1LLTT.LI 






TCCTGCGCAC 


CTCGGCCGCG 


TGCAGCTCCT 


GCAGGACAGG 


LLLLL LLALL 


LLL 1 LALLLL 




25 


GGGGGTGGCT 


TGGGGCGACT 


CCGGGAACCC 


CCAGGCGCGC 


a nf^rr* p tp pp 
ALLL L LI LLL 


LLLL i LLLAL 






CCGCCCGGCC 


TCATCCGGGC 


TGGCCTTCGG 


CAGGACCCTG 


ACTGAGTTGA 


GGGGGCGGGA 


30720 




GCACCGGGGA 


GGCGCAGAGC 


AAGGCCAGGG 


ACCAAGGACG 


GGTTTCCTGG 


GAGCTGGCTG 


30780 




GGCCCCGCTT CTAGCTCGTA 


CCGGAGCCGA 


GCTTCCTTCA 


GGGCACTTTC 


AATATAATGA 


30840 




ATTTAGCCAT 


CTATTACTGC 


GGCTAGTTAC 


TGTCCCGCCA 


GG AC CAGACT 


CTGGACCTGC. 


30900 


30 


CTCGTGCGCT 


GCTGGGGACG 


CCCAGTAAAC 


ACGGGAGGAG 


CCCCCGACCC 


CCACCCCAGC 


30960 




TCAGCGCCTC 


GGAGTCCCCG 


GCCCCGCTCT 


GCGCCCCTCC 


GAGCTCCGCC 


CTAGCCCCGC 


3102.0 




CCCCGCCCAG 


TGCCCCGCCC 


CCTGCCTGCT 


GCTAGCCCTG 


CCCCCGCCCC 


GGCCCCTGCC 


31080 




CGCTCCGAGC 


TCCGCCCTGG 


CCCCGCCCCG 


GCCCCTGCCC 


GCTCCGAGCT 


CCGCCCTGGC 


31140 




CCCGCCCCCC 


GCCCAGTGCC 


CCGCCCCCTG 


CCTGCTGCTA GCCCTGCCCC 


CGCCCCGGCC 


31200 
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CCTGCCCGCT 


CCGAGCTCCG 


CCCCbbLLbb 


cnr* c c err 1 c P 
bb b b b bbl b ^ 










CCTGGCCCCG 


CCCCCGCCCA 


GTGCCCCbbb 


rrrTp 7\ ever* 
LLL 1 bAb 1 bb 






"11 TTA 

J 1 J4 y 




CGGCCCCTGC 


CCGCTCCGAG 


CTCCGCCCCG 


^'P , p 1 P , p , fT > ^'^ , p' , 
Gbbbbbbbbb 


bbbbbb 1 btL 


rp/"?pppsrp 
bbb 1 bbb Abb 


JlJoU 




TCCGCCCCGG 


CCCCGCCCCG 


GCCCCTGCCC 


bblbbbAbb 1 






1 1 a a n 


5 


CCCCTGCCCG 


CTCCGAGCTC 


CGCCCCGGCC 


CbGbbbbbbb 


ALL 1 ltl bbb 


bb Abb bbb 1 b 


■) 1 Cflfl 




GCGCAGTGCG 


GCCAGGTGTG 


CCTCGCGGAT 


CTCCTTGCTG 


AGCTCCATbT 


TGTAbTTGAG 


J lobU 




CTTCTCCTCC 


GCCTGGCGGC 


TGAAGTTGTT 


ATTCTCCTCC 


AGCGCCTTGT 


GCAGbAbbTb 


Jlo2U 




GCGCTCGTGC 


TCGCGCCGCT 


CCGCCAGCTG 


CTTCAGCACC 


TGCGCCTCCT 


GCGTCTGTGC 


J168Q 




GGGGCCGGCG 


GGCGCGCGTG 


AGCGGCAACC 


CCGGGCCCTG 


CCCGGCCGGA 


CTCCTCCCTG 


"3 1 1 A Pt 

J 1 / 4U 


10 


CTCTCCGCCT 


CCCGCCCAGC. 


GCCCGCTCGC 


CTCACCTGGC 


GCCTCCACCT 


GCCCAGGCCT 


31800 




CGGTGGGCGC 


CGGGACCCCC 


GGGCGCTGCC 


CTGGGAACCC 


TCGCCTGCCA 


TCCGGCCTGT 


J looU • 




GGTCGGGGCA 


GGGCCAGGGG 


GTCGCGATCC 


GCCGCCCCCG 


CCCCCGTCCC 


TGCCTCGCGC 


j l y z o 




GCGGGTCCCG 


CGGTCCTGGC 


TGCGCCCAGG 


GCCCCCGCCA 


TACCCTGCCG 


CCACTGCACA 






CCCTGCCCTG 


CGCGTCTGCC 


CCTCCAAGGA 


CCAGCAGCAA 


GAAACCCTAA 


ACTTGTGGGC 


32040 


15 


GGTCTCTGAG 


CTTTGTCTCT 


TCCTCGGACA 


TCCGCCCACT 


GAGCAGAGTA 


GCTGCTTGTT 


32100 




ACACACCGGG 


TTCCCAGCTC 


CCAATTAGGT 


GCCCAGGAGC 


GGAGGGTCCC 


CAGGGATGCT 


321oQ 




GGGGGAGGGG 


CGGGCTGGTG 


ACCCCTGGGA GGAGAGCGGG GCAGCAGGAC CCGCACCCAC 


32220 




ATGCCAGTCC 


CTACTAGTCA 


GCCCTGTGAA 


CCCTGGTCTC 


TGGCCTCACC 


GGGAAGGGAA 


32280 




CGGAGCCGCT 


TCCCCTGCCC 


AATGCGTTGG 


CCTCCAGGGT 


GGCACCCCCA 


AAAGGACATT 


32340 


20 


' TTTATCTCTG 


TTTCAGTCTC 


AGAGGGGCTG 


GTGGGAGGGG 


AGGCTGCAGG 


GAGGGGACCT 


"1 "i A Pt Pi 




GGAGCCCACA 


CCCACCTCTC 


CCAGGGCCCC 


TCCGCCCTCC 


AGCAAGCCTC 


AGGGTCTTCA 


T T yl £T A 




CACATGAGGC 


CCTTCCTCCA 


GCTTCCCTGT 


CTGGGAGAGG 


GATGCCCCAC 


CCGACGTCCC 


J2 




CAGGGCCCAT 


CTGGGGACCA 


CCCCCTAGCA 


TCCTGCTGGC 


CCTGACAAGG 


GTGCCTCCCA 


neon 




CCCTCACCAG 


AGGCTCCTGC 


TCCTTCCAGG 


TGGCCGCCTC 


GGAACCCTTC 


CTCCTCTCCA 


j Z OftU 


25 


TCCCTTTCTT 


TTTTTGTTCT 


TGTTTGTTTT 


TTGAAATGGA 


GTCTCACCCT 


GTCGCCCGGG 






CTGAGGAGTG 


CAGTGGCGCA 


GTCTCGGCTC 


ACTGCATCCT 


CCACTTCTTG 


GGTTCAAGCA 


32760 




ATTCCCGTGC 


CTCAGACTCC 


CTAGTAGGTG 


GGATTACAGG 


- TGTGCACCAC 


CACACCTGGC 


32820 




'taattttgta 


TTTTTAGTAC 


AGATGGGGTT 


TCACCATGTT 


GGCCAGGCTG 


ATCTTGAACT 


32880 




TCCAACCTCA 


AGTGATCTGC 


CTGCCTCAGC 


TTCCCAAAGT 


TCTGGGATTA 


CAGGCGTGAG 


32940 


30 


CCACCACACC 


CGGCCTCTCC 


CCATCCCATT 


CTTATCTCTC 


AGAAAGAGGC 


CCAGGGAGCC 


33000 




' ACAGCCCCTC 


CTGCTCCAGG 


CCAAGGCACT 


GACCAAGCCT 


GTCCGGGAGC 


ACCCTGCTTC 


33060 




TTGCAGGCCC 


TGTCCCCGTG 


GGCCGCCTCC 


GTTGAAACTC 


CTGGGGGGTG 


GGGGATGGAG 


33120 




GACTCCTTGC 


CTTCCTCCGC 


TCCTCGGCTG 


CCTCCAGCCG 


CTTTTGCAGC 


TCCTCCAGGG 


33180 




AGGTGTCCTT 


CTTCTTGGGT 


GGGGAGGAGA 


GCATAGGGCT 


CTCTGGGGAC 


AGGTCAGAAG 


33240 
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GGGACTTGAG GATGACCTCG AAGCTCTGGC 
CCATGTCTGC AGGGCAAGAC CAGAGTAGAG 
GCTGAGCGGG ATGCTGCCAG CACACATCCA 
CTCCCGCAGG AATCCCAGGC TTCAGCCCCC 

5 TCTCCAGAGG CAGCCAGGAC GGGAGTTCAG 
AGGTGGGCAG GCACCGCAGG GAAGGGCAGG 
AGACGGTATT GGGGTGCGGC TGTGTGTAGA 
GCTCCTTCAT CTTCTCCTTG TAGGCTGTGG 
GGGCCTGCCC ACCTGGGCCC CCGTTTTCCC 

1Q GAGACACGGA GCTGCCCAGC ACGCTCTCTT 
TCTCCAGCTC TCTCGCTTCC AGACGTCGGC 
GTCTCTCTCG CCCTGTGCCT CTCCCCGCTT 
ATCTTCCACA CTGACTTGTT TCTCCATCCT 
ATTACCCAAA ACTCATGTGT TGAAATCGTA 

15 CATTCGGAGG GAATTAGGCC ATGAGGATAG 
TCAGAGAACT CCCTCACCTT CCATCATGTG 
TGTACCAGGA GGCGAGACCT CTCCAGGCAC 
TCTGGCCTCC AGAGCGATGG GAAATAAGTT 
TACCTGCCCA GACTGACAAA GTGGCTACCC 

20 TGTGTCTGAC TCTGTCACTG TCATTGTATC 
CTGTCTTTCT CCCTGAGTGC ATCTTTCTGT 
CTTACCTCCC TCTGTCCCGC TACTTCTCTC 
GCTCAAGCAG GCAAGATTTA CTCATGACGG 
AGGACTTTCT TGGGCTGTAA ACCTGGATGA 

25 TGATTGTGTG GTCACCTTGA GATGAAACCA 
TCTAGGGCTG GGCTTGGGCC ACTGATGCCA 
CAGCACCACA GCCAGCCCTC TTGCCCATTC 
CCTGCTGCCC AACCCAAACC AGAAGCAAGC 
GGAAGAGGAA GGCGCCTGCT GTCACAGTGA 

30 GCGAACCTCA TTCCTAAGCA GACACGCTGG 
TTTGGCTGGG TGCAGTGGCT CATTTAACTG 
GTGGAACACC TGAGGTCAGG AGTTTGAGAC 
CTACTAAAAA TATAAAAATT AGCCAGGCGT 
GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC 



PCT/USOO/33065 



CTGAGGCCCG 




lbL l 1LALL 1 


it inn 


CTTCAGAGGC 


prrrrPA err* 


LATGGCCjTGG 


J.J .JoU 


ACCCCAGGGC 


TGGGCGAGAG 


GGGGTGGCTG 




AGGATGGGCC 


CCTTCCCCCT 


AGAACCTCCC 


33480 


AGAGACTGCC 


GGAGGCCGGG 


GGAAAAGGTG 


33540 


CGGCAGCCAG 


GCACTCACCC 


CCGTACTGGT 


33600 


AGCAGGAGCA 


GATGAGCGAC 


AGCACGGACA 


33660 


GCACAAGGCT 


GGGCTGAGCA 


AGCACCACTG 


33720 


TCCCCATGGC . 


TGCCTCTATC 


ATGTCTCTGT 


33780 


GTGTGTCTCC 


ACACCGCCGG 


CCCCTTCGTC 


33840 


ACTGTCTCCG 


TGGTGTGTCC 


CCTGCCTTCT 


33900 


TTCCTCTCTC 


TCGGCATTAA, 


TGTCTGTCTC 


33960 


TCTCCTGCCT 


GCTGTGGTCT 


GAATGTTTCC 


34020 


ACCCCAAGGT 


GCCGGTGTGC 


GGAGGTGAGG 


34080 


AGCCCTCCTA 


AGTGGCCCCA 


GAGTGGGGCT 


34140 


AGGACACAGC 


CAGAAGACGC 


CACCCGTCTA 


34200 


CGACTCTGCC 


GGCACCTTGA 


TCCTGGACTT 


34260 


CCTGTCGTCT 


ATAAACCACT 


CAGTCTCAGG 


34320 


CTGCCTGTCT 


GGGTCTCTGT 


TTACCTTCTG 


34380 


TTTCTGTGTC 


TCTGGGGGTA 


GCCCCTGACT 


34440 


GATTCCTTGT 


CACTGTGTGT 


CTTTCTGACT 


34500 


TCCCCTCCTC 


CTCCTTCCCA 


CTCCTCGCCA 


34560 


GACCAGCACA 


GATGCAAACC 


CTCTGTGGGC 


34620 


AGCCCTCAGA 


CCCTCCTTTT 


TCCTTCCCAA 


"3 A c Q n 
j 4 b 0 U 


GGCCCTCTCC 


AGGCACATGC 


TCTCTGTLTA 


1 A 1 A Ci 

34 /4U 


CCAAGGAGCA 


AGGGAGGGAA 


GCTGTCCGTT 


34300 


AGGTCAATCA 


AGTGCCCACC 


AGCCAGTGTC 


34860 


CGGGCTCCTG 


TGGCCCTGTG 


CCCTGTCAGG 


34920 


AAATAATTTA 


GCTCTTTTGG 


TCTATTCAGG 


34980 


CCCGGTTTCT 


CACTAGTGCT 


CGATAATCCT 


35040 


TAATCCCAGC 


ACTTTGGGAG. 


GCCAAGGCAG 


35100 


CAGCCTGACC 


AACATGGTGA 


AACCCGATCT 


35160 


GGTGGCAGGC 


ACCTGTAATC 


CTAGCTACTT 


35220 


CTGGGAGGCG 


GAGGTTGCAG 


TGAGCCGAGG 


•35280- 
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TCGCGCCATC GCACTCCAGC CTGGGTGACA 
AAAGAGAGAG AGGAAGAAAG GAAGGAGGGA 
AGGAAGACAG ACAAGGCAGA AGTAATCAAG 
CAGTGCAGAG AATGGTCTGT CCTGACTTAA 
5 GAGCAGCACA GAGCCCAAGA GGGTGTAAGG 
GCCCGTGCGA GAGCCTTTGG GGGAAGGAAT 
TACCTGTCTT GGTGATTGTC TTCAGGGAGG 
CACAGGGGAC TTTGGTCTGA CTTGTCAAGT 
. GCACTGTTGC CCAGGCTGCA GTGCAGTGGT 
10 CTGGGTTCAC ACCATTCTCC TGCCTCAGCC 
ACGACACCCA GCTAATTTTT TGTATTTTTA 
GATAGTCTCG ATCTCCTGAC CTCGTGATCC 
TACAGGTGTG AGCCACTGTG CCTGGCCTAC 
TTGCCCATGC TGGAGTGTAG GGTGCAGCTG 
15 CTTGACGGAT CCTGCCATCT TAGCAGCTGG 
TCTTTTTTTT TTTATCTCTG CTCACTGCAA 
GCCTCAGCCT CCCAAGTAGC AGGGATTACA 
GTATTTTTAG TAGAGACAGG GTTTCACTAT 
CCGCCCGGCC TACACACCAG CTTAAAAAAA 
■20 GCCTGTAATC CCAGCACTTT GGGAGGCTGA 
AAGACCAACC TGGCCAACAT GGCGAAACCC 
GGCGTGGTGG CGGGCTCCTC TAATTCCAGC 
TGAACCCGGG AGGTGGAGGT TGAAGTGAGC 
AGCAAGACTC CCGTCTCAAA AAAAAAAAAA 
25 GTGGCTCACG CCTGTAATCC CAGAACTTTG 
GGAGTTCGAG ACCAGCCTGA CCAACATGAT 
ATTAGCCAGG CATGGTGGCG GGCACGTGTA 
GAATCGCTTG AACCGAGGAG GCAGAGGTTG 
AGCCTGGGTG ACAGAGTGAG ACTCTGTCTC 
30 ATACATGTAT ATATATAATA T AT AT AT AC G 
ATACGTATAT ATACACGTGT ATATATAATA 
TATACGTATA TATACACGTG TATATATTAA 
TATTAATATA TATACGTATA TATGTGTGTG 
TATATACATA TATATATACA GAGAGAGAGA 



GTGTGAGACT 


LLb 1 L 1 LnnA 


Af Af" 1 A A AT7\ A 


jj J4 U 


GGGAGGGAGG 


AAAAGAAGAA 


AGGAAAGGAA 


J:>4U U 


CCTTTCATGG 


TGAGLTGGb 1 


L 1 I L rGGTGA 


"1 C /I C fi 


ATTTCCTGGT 


GACCTACACT 


T i TCTGGAC A 




AGGAGCAGAA 


AGGAATCCCA 


GGGTGGGCAG 


35580 


GAGACTTTGA 


GCCGGGAAGC 


GAGGCAAAGC 


35640 


GAGATGGAGG 


GGGACCAGGT 


GGGGGAGCCT 


35700 


TTTCTTTTTT 


TCTTTTTGAG 


ATGGAGTCTT 


35760 


GCGATCTCGG 


CTCACCGCAA 


GCTCCGCCTC 


3 5820 


TCCCGAGTAG 


CTGGGACCAC 


AGGCACCGCC 


35880 


GTAGAGACGG 


GGTTTCACTA 


TATTAGCCAG 


35940 


GCCCGCCTCG 


ACCTCCCAAA 


GTGCTGGGAT 


36000 


TTTATTTTTT 


AGAAACAGGA 


CTGTGCTCTG 


36060 


TGCGGTTCAC 


TGCAGCCTTG 


AACTTCTGGG 


36120 


GACTACAGGT 


GCATGCCAGC 


ACACCAGTTT 


36180 


TTCCGCCTCC 


TGGGTTCTAG 


CGATTCTCCT 


36240 


CGCACATGCC 


ACCACACCCG 


GCTAATTTTT 


36300 


GTTGGTCAGG 


CTGGTCTTGA 


GCCACCGCGC 


36360 


AGAAAAAAAT 


AGCTGGGCGT 


GGTGGCTCAT 


36420 


GGCAGGCAGA 


TCACCTGAGG 


TCAGGAGTTC 


36480 


TGTCTCTACT 


ACAAATATAA 


AAATCAGCCA 


36540 


TACTTGGGAG 


.GCTGAGGCAG 


GAGAATCACT 


36600 


CAAGATCGAG 


CTACTGCACT 


CCAGCCTGGG 


36660 


AAATTTGTAG 


TGGTATGGAG 


GCCGGGLATG 


J 0 / £. U 


AGGGGCCAAG 


GCGGGCAGA1 


LA aGAGG 1 LA 


jo / oil 


GAAACCCTGT 


CTC TACT AAA 


AATAACAAAA 


36840 


GTCCCAGCTA 


CTCGGGAGAC 


TGAGACGGGA 


36900 


CAGTGAGCTG 


AGATCACGCC 


ACTGCACTCC 


36960 


AAAAACAAAC 


ACAAACAAAC 


ATATATATAT 


37020 


TATATATACA 


CGTGTATATA 


TATAATATAT 


37080 


TATATACGTA 


TATATGTATA 


TATTAATATA 


3-7140 


TATATATACG 


TATATATACA 


CGTGTGTATA. 


37200 


TGTGTATATA 


TATATGTATA 


TATATATATA 


37260 


GAGTAGTGAT 


AGGTCTTGCT 


GTCTTGTCCA 


37320 
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GGCTGATCTT GAACTCCCGG 


CCTCAAGAGA CCCTCCCACC 


tp ap revere 


a aacpapt ap; 


J ' JOU 




GATTATAGGT GTAAGCCACA 


GTACCTAGCC 


TATTAAAAAT 


L AA lul 1 AAA 


paap appatp 


17 AAf) 




TGATGAGGGA GTTAGAGGGT 


GTGCCAGCCA 


TGTGTTCCAC 


AGL. Abb Aoo 1 


pappap apat 

bAbbAbAb A 1 


j / DUU 




TGGGGACATT TAGAGGAGCT 


GAAGAGGTGG 


CCAACCCTGT 


bb 1 b Abb Abb 


a P p p_p rr a p p 

Ab bbbbb Abb 


J / joU 


5 


GAGAGAGCAA GAGGGAGTTT 


GGGCTGGGGC 


A^AACGTACC 


iGGGILL 1GA 


p app at" a ara 
b Ab (a A 1 ArtOn 


J / OtU 




AGGTAGGGAC TTGGCCCCTC 


CAGGCCTGAC 


•tptp,ppap.pa 


ACCAGCTCCC 


TATCAGCAGA 


neon 




CTCCAGGCCC CTACCCTTCA 


GCTCATCCTT 


CCTTATCACA 


CATC C AAAAC 


TCTGAATGTG 


J / / 4U 




GCCGGGCGCA GTGGCTCACG 


CCTGTAATCC 


CAGAACTTTG 


GGAGGCTGAG 


/''^n rr<^ rn 

GC AGG AGG AT 


6 1 oUU 




CGCTTGAGAA CAAGAGTTTG 


AGACCAGCCT 


AGGCAACATG 


GTGAAACLLL 


Alb ILTAL 1 A 


J / ODU 


10 


AAAATATAAA AATTAGCTGG GTGTGGTGGC 


ACATGCCTGT 


TGCCCCAGCT 


ACTCAGGAGG 


j / y 




CTGAGGCAGG AGAATCACTT 


GAGCCTGGAA 


GGCGGAAGTT 


GTAGTGAGCA 


?V P 7V rn*71PT>PP 

GAGATTGTGC 


j / y oy 




CACTGCGTTC CAGCCTGGGC 


AACACAGCGA 


GACTCTGTCT 


CAAAAAACAA 


AAACT GG AAT 






GTGTTTACCA TAAAGGCCAG 


AAAATGTGAT 


TAACAGCTGC 


TCAAAGCCCC 


TGTCTbbbLT 


Jo XUU 




AAGCCTGAAA TTTTCACCGA 


AAAAAAGATC 


TGTAGGCTCA 


m TV TV A** 71 7% 7V 

T AC AG AGG AA 


GG AC AAAC Ab 




15 


CAGGGAGGCT CTCTTCCAGT 


TTGCTTCACC 


TCAGCAAGCA 


GACGGCTGGC 


AbbAATTTGG 






GGGCAGGTGT GAGCACCTGC- ATCATCAGGA AAGAAGGGGC 


ACGGTGGGGA 


CGCAGGTCAG 






ACCTCTCACA GGTCTTGGCT CTGCCCAGGA GACACGTGTC 


CAACTGAGAG 


GTGAGGAACT 


38340 




GGGTTCTGCA GCTGCAGACA 


CAGGTGCGGC 


TCAGCATCTG 


ATGGCCACGG 


AGACCCCCTG 


38400 




GCTTGGCTTC TCCCAGCTGG 


TGGCCCATGA 


GGAGCTTCTA 


TCCCAAGAGA 


CTGTCCCTCA 


Jo4oU 


20 


AGGAGCAAGT GGGACCAGGT ACCCACAGGA CGGAGCCTGG 


GAGTGAGGCC 


TGCCCTGTGG 


noon 




TCT.GGCTACA GGGAGGAAGG 


GCAGATTGGA 


GGGGGCAGGA 


CAGCAGGTCA 


GGAATTGGCC 


■JOCQfi 

JODOU 




AACTCTGGAG AGAGCAAGCA 


AGGGGAAGTC 


TGCGCACAGG 


GC AGGGC TGG 


TLAGGGGCGA 


job4U 




GGCAGGGCAT TGGACCAGTA TTTTCAGAGC TGGTGAGGCT 


TAAAGAGLAT 


GIU IAL IGLL 


JO / uu 




TCTTATTACA GAGAGAGGAT 


GCCGAGGCCC 


AGACCCATCC 


AbbbbAbb i b 


TPP AP AP AP A 


JO / «U 


25 


CAGCTGGTGC CAGGGAAGCC 


CCTCCCAGAG 


CCTCAAGGCA 


TTGCTCCCTC 


TCTCTCTCTC 


13R? ft 




TTTTTGTTTT TTTGGAGACG 


GAGTCTCACT 


CTGTCTCCCA 


GGCTGGAGTG 


CAGTGGTACA 


38880 




ATCTCGGCTC ACGGCAAGCT 


CCGCCTCCCG 


GATTCACGCC 


ATTCTCCTGC 


CTCAGCCTCC 


38940 




CGAATAGCTG GGACTACAGG 


CGCCCGCCAC 


CACGCCCAGC 


TAATTTTTTG 


TATTTTTAGT 


39000 




AGAGACGGGG TTTCACTGTG 


TTAGCCAGGA 


TGGTCTCGAT 


CTCCTGACCT 


TGTGATCCGC 


39060 


30 


CCGTCTCAGC. CTCCCAAAGT 


GCTGGGATTA 


CAGGTGTGAG 


CCACCGCGCC 


TGGACTTTTT 


39120 




TTTTTTTTTA AGACGGGGTC 


TCACTCTGTC 


ACCCAGGCTG 


GAGTGCAGTG 


GCGCGATGTC 


39180 




GGCTCACTGC AACCTCTGCC 


TCCCCAGTTC 


AAGTGATTCT 


CCTGCCTCAG 


CCTCCCAAGT 


39240 




AGCTAGAATT ACAGGCACAT 


GCCACCATGC 


CCAGCTAATT 


TTCTGTATTT TTAGTAGAGA 


39300 




TGAGGTTTCA CCATGTTGGC 


CAGGCTGGTC 


TTGAACTCCT 


GACCTCCGGT 


GATCTGCCCA 


39360 
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CCTCAGCCTC 


CCAAAGTGCT 


CXZC A T P, A P A d 


GCGTGAGCCC 


CCGCGCCTGG 


CCCCCCGCAG 


39420 




TGCTGGGATT 


ACAGGCGTGA 


vj ^ v» V- ov» vj 


CCGGCCCCTC 


CCTCTCTTTG 


ACTCCCTTCT. 


39480 




TTCTCACCGC 


CCCCTCCCCA 


CCATCCTTCC 


CCTTCACTGA 


CTTCAGGGAG 


TTAAAAACAA 


39540 




TTCTCGCAGT 


GAGCTGGGCT 


TGTTTTGTCT 


CCCTGCTTCT 


CTTTGTACTA 


AACATTAGAT 


3 96 0.0 


5 


ACCGAGGAAA TGCGGATTGG 


CCTTTGGATG 


ATTCATGAGC 


AGGAGTCAGA 


AAAAGGCACC 


39660 




. AGGTTGGCCT 


CAAGCAGCAG 


GGTATAGTAG 


TGCCCGCTCC 


CAGGGTCACA 




39720 




ACCCCTCCCG 


CCGTCCAGGT 


GG ATGGTGCC 


CACTCCCAGG 


GTCACACCTC 


&rczrrr appp 


3 9780 




CTCCCGCCGT 


CCAGGTGGAT 


GGTGCCCACT 


CCCAGGGTCA CACCTCACGC 


ILL 


39840 




CGTCGCCCAG 


GTGGATGGTG 


CCCACTCCCA 


GGGTCACACC 


TCACGCCCGC - 


CCCTCCCACC 


39900 


10 


CACCCGGGTG GATGGTGCCC 


GCTCCCAGGG 


TCACACCTGA 


CGCCCACCCG 




3 9960 




GCCCGCTCCC AGGGTCACAC 


CTCACGCCCA CCCCTCCCGC 


CCGCCCGGGT 




40020 




CGCTCCCAGG GTCACACCTC 


ACGCCCACCC 


CTCCCGCCGT' CCAGGTGGAT 


p,p,tp,pppapt 


40080 




CCCAGGGTCA CACCTCACGC 


CCACCCCTCC 


CGCCGCCCAG 


GTGGATGGTG 


ppp arTrrra 


40140 




GGGTCACACC 


TCACACCCAC 


CCCTCCCGCC 


CACCCGGGTG 


GATGCCCTTA 


TP AP,PTPTPP 


40200 


15 


TTCTCCTTCT 


CTTTCGTCTT 


CTTCGTCTTC 


CTCCTCTTCT 


TTCTTCTTTT 


TTTTTTTTTT 


40260 




TAGAAAGAGT 


TTCTACTCTT 


GCTGCCCAGG 


CTGGAGTGCA 


ATGGCACAAT 


PTP AP,PTPAP 


40320 




TGCAACCTCC 


CTCTCCCCGG 


GTCAAGCAAT 


TATCCTGCCT 


CAGTCTCCCA 


P aT^PPPTflPP 






ATCACAGGAG 


TGTGTCACCA 


CACCTGGCTA 


ATTTTGTACT 


TTTAGCAGAG 


APPPPPP A'PT 


A fi d A Ci 

*4 U ** *i v 




TCACCATGTT 


GGCCAGGCTA 


GTCTCGAACT 


CTTGACCTCA 


GTTTATCCAC 




*i \J Z3 U U 


20 


CTCTCAAAGT 


GCTGGGATTA 


CAGGCATGAG 


CCACCCTATC 


TGCCTCACTT 


PTAP AP APPA 






GGAATGAAGG 


CTCAGAGAGG 


GCAAGCATTC 


CACCCAGCAT 


CACACAGAGT 








AGCCCAGTCA 


TGAGCCTGGG 


CCTGACTGCA 


GGCTCCTGTT 


GGGAGCTCGC 


P,P, A P,P,T P, CZCZCZ 


40680 




GATCTGTCGA ■ GAACTGAGAG 


GCCAGGGGAC 


CACAGTGGCC 


TCTGACCCCT 


GGAGGGCCCT 


40740 




GGAGGCTGCT 


GCCGGCTCCC 


CCCGGGGGCA 


GATGGAGGTC 


ACTGTCACCC 


AGGCTGCTTC 


40800 


25 


TCATGGTGCC 


AGGAGCACAG 


CATGGCAGGA 


GCCACCAGCC 


GATTTGCCTT 


TCCCTGGGCA 


40860 




GGAAACTCAG 


AAATGTGGCT 


ACCACAGTCA 


GGCTGCTTGA 


CGTGCGGTGA 


GCACTCATCT 


A A 0*9 n 
4U 3 £, U 




CTTAGCAGGC 


AAGCGGCCAA 


GCACCTTTCC 


TGAAATATTG 


AGGCCTCAGA 


ACAAGCCCCA 


40980 




GGAGAGGTGC 


CAGCACCGTC 


ATCTCTACCC 


AGATAAGGAG 


ACCCAGGTCC 


TGAGAGGTTA 


41040 




GGCAGCTCGG 


ACAACACCAC 


ACAGCTGGAG 


GAGGTCAGAC 


TCTGGGTTGC 


AGAAGGAGAA 


41100 


30 


TGTGAGCAGA 


GGCCACAAAA 


GAGCGAGGAG 


CCAGTGCCCA 


GATGCCGAGA 


TGCCCTCGCC 


41160 




CTCCCAGCTC 


AGCCCCAGGA 


ACCGAGCCCA TGGGGAGGGA 


CCGTCAGGGA AAGGCTGTCA 


41220 




GGAAGGGCAG 


GAGGCGGCCC 


TGGAGAGGAC 


GGCGCTGCCC 


TCAGGGGCAG 


GAGGGGAGTC 


41280 




CCCTCCGCTG 


AGAGCCCCCC 


CACCCCCAGT 


ATCCCCGGGG 


GTGTCCAGGA 


GGAGGCGGAG 


41340 




GGAGGAAGCG 


CAGATGGACA 


GGACTCCCAG 


ATAGGGTGGG 


GAGGTGTGGC 


CGGTGACACA 


41400 
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CACGGTCCCC TCCTGGCAGG TGCTGAAGTC ACCTGGAGCC TCCAAGCCCG TGGGGCCTGA 41460 

GGGGCGGGGT CAGGTCGGGC ACGCGTGGGT GGGCGGAGTT CTGCGCCCCG GGCCAAGGCG 41520 

CCCGAGTTGA ACCAGTCAGC TCGGGAGAGG GACCGCGGCG ACCTGTCCCG GGGGCGTAAG 41580 

AAAAGGTGGG AGGGAGTGCG GCTCGTGAAC GGGGGCGGCG ATGGGAAGGA GGTGCGGCCC 41640 

5 TTCGTCCTGT CCTCCCAAAC GTCGAGTGAA AAACGAAGCG GGTTCTGCGG CCTCGCGGCG 41700 

GAGCAGAGCG TTTCGGGAAG GGCGGGCCCA GCGTCCTCGC GCCCGAGGTC GCCCGGCAGC 41760 

TCCCCTGCGT CCAGAATCCG CCCCCCGCCC GGGCCTGCGC CCGCCCCTCC GCCTGAGCTC 41820 

CGCGCGGGAC GGGCCGGGAG GCCGGGGTGG GCGCTACCTT CGAAGGCGGT GGGTCCGCCC 41880 

CGCGGGAGGT GGAGGGGCGG GAGGGGCGGA GCCCTCTGGT CTCCGGAGGG TTTGGGGATC 41940 

10 GCAGTOGCCC CTCCCCCATC CAGACCCCGC GGCGCAAAGG GCAGTGGCTT TTCTGGCCAG 42000 

AGCAGGTGGC GCGGGCGTCG CAAAGGGTGG TCCCCGAGGC CGCAGCGGTG TGGGGGGAGG 42060 

GCGCGGTCCC CCTCACTCCG GGCTCCGCCG TGTCTGGCCC GCCCCCCTCC TTCAGCGCCC 42120 

CCTCCAGCCC CTGTGCTGCA CTGGCGCGGG GAGCGCCGGG TTCCCGGCTG GGGCTTTGGC 42180 

AGAGGGTCCC ACCCTCTCCC CGCCTCCCCA CGAAGGCTCT GGCGGACCCA GATCTCGGGT 42240 

15 CGCCGGACGC CCCAGGGAGC CCGCCCGCAC ATCGCGAGCG CGCCCACCCG ■ GTCGCGAGCC 42300 

CACGCCCGGG TCTGGGAGCC ACCCTGCGGC AGTCGCGCCC TGCGTGGCAC GCTGCTCCCC 42360 

CAGGGGCGAG GCGCCCCCGC CCGACGTCCC GGTCCCGAGC GCTCCCCGGC GCGGCGCCTC 42420 

GCAGCCCAGC GCCCCACCAG CCCCGCCGGC GCCGCAGACC CCAGCCTCGG GCGGGTCGGG 42480 

CCCAGGCTTG CAACGCGCAG GGTAGGAGAA GGGAAATTGG CGTCCGCTGC CGGCCGCTGC 42540 

20 CCCAGGCGAG GCCAGACGAG GCCTCTGCTC AGATCCCGCC GCCCCACAAA GCCCGTGGCC 42600 

CCGGAGCCTA CCGGAAATGG TGCTGGCCAT GGTGCTGGCG GCGGTTGGGC CTGCGGAGGC 42 660 

TGGAGAGGCG CAAGTGGCGG CCGGAGCTGC AGACGGCTGG TGCTGCAGTG CCGGGGAGGG 42720 

GAGGGGAGAG GAGTGGAGGG AGCGAGGGCG GGCGGGAGGC GGGCGCGGCG GGAGAGAGAG 4278 0 

AGGGAGGGAG ACAGAGGGAG .AGAGAGAGAG GGTTGGGGGA AGGAGCGGGG GGAGGAGGGA '42840 

25 GGGAGGGTTG GGGGAAGGAG AGAGAGAGAG AGAGAGACTG CGGGGGCGGG' GGAAGGAGGG 42900 

AGGGAGGAAG GGAGGGAGGA AGAGAGAGAG GAGCAAGCGC CTGGCTGCGG AAGGGGCCGC 42960 

GGCTCTCAGG GGGAGAGGGC GGAGGAGGGG GGCTACCCGA ACTGCAACAA GACCCCCCAC 43 020 

CCTCCAACCG CTCACAGCGG GACAGCTGCT TCTCCAACTT GGCTTTGTGA GGCCTGAGAG 43080 

TGGGGTGGGG GTGGAGATGA GCCCCCATTC CCCAGGGCAG GCGGGGCAGG GGCAATGCCG 43140 

30 GAGGAGCAGG TCCCACCCAT GGGGTGGGGC CGCAGAGCTC TTCGCCGCCA AGGCCGCTGT 43200 

AGGCTGGGCT GGCGCCAACA GGGTCCAGGT CTGTGCCTGC CATCGGAGAG GATGCGAGAG 43260 

CCACAGGGGT GGGCGCTGGC CTGGAGGCCT CCAAGGGGCA TCTCCTGTGA GCCCAGGGGA 43320 

TGGGCAGGAT CTGAGCGGAG AAGAGTGAAA GTGGAGGAGT GAGGCCAGAA CAAAGGCTTT 43380 

GCCGTGAAAG AGGTGGTTTC CCGCCTGGGC TCAGACCTTC ACTCACTGTG TGGCCCAGGC 43440 
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CAAGGGCAAG CGTCTGACCT CGCTGGGCCT 
TGCCCCCAGA CGATGGAGAG GAGGGGTGCC 
AACCTATCCT GCCTCCCCCT CTCCTCATAA 
. GAGCGTCTTT GCCATTGCAG GGATGAAGCC 
5 ACAAGCAACT AAGTCATAGG GCCAGGAGCA 
GCCATGAGGC TGGCTCAGCA ACACAAAAGC 
TGGGAGCACC CAAGGCCAGT CACATCCTAG 
GCAAGCCGGC ATCATCCCCG GGACTGTGAG 
ATTGTGTGGC CCAGAGGTAC CCTTAAGACT 
10 CACAGGAGCC CTTTAAAAGC AGTTTCCTTT 
CAAAGAAGGT TTTCTGTTGT TGAGATGAGG 
CCCAGCGAAC AGCCAGCAAG ACAACAAGAC 
AAGAAGAGAA TGGGCTTGGA GGCAGGTTTG 
CAACTGCCCA CTTGGTTTCA GCCTTGGGTT 
15 TGTTTTAAGC TGTTGGTTTT GTGGTGATTT 
ACAGAGGTGT TGGCCCCTGG GCCATTCCTA 
GCAGAGAGGC ACACAGAGCT GCCCACCACA 
GTAGCAGTTG GGGATGGTCA GCATCCTTCA 
AGGTGCTAGG GATCAAGAGA AGTAGCCCTC 
20 TGGGGTAGGA CCCACCCAGT CAGGCTCAGA 
' CTGGCCAGGC CCTCTCTGAG CCTCTGTGTC 
GAAGCCCGTG , GC ATAGTGCC AGGAACACAG 
ACACAACAGG GGTACTGACT GTCAGCACCT 
TCAGCCATAC TGACTGTCAG CGTGCTAGCA 
25 CTAGCAGGCA CATGCCATAG GTGTACTGAC 
GAGTAATGAC AGCATGCTGG CAGGCACACA 
GTGCCAGGTC TTGGAGCAGA TTTTGACTTC 
GCATTTCAAA GATGTTTTTC ACATGCACAT 
GTGCAGTGGC TCACAACTGT AATCCCAACA 
30 GAGGTCAGGA GTTTGAGACC AGCCTGGCCA 
ACAAAAATTA GCCAGGTGTG GTGGCAGGTG 
GCAGGAGAAT CACTTGAATC CAGGAGGCAG 
CACTCCAGCC TGGGCAACAA TATCAAGACT 
AGTGTGACTT TTATCAGACC TCTGCATTCT 



PCT/US00/33U65 



TTGTTTCTCA 


bbbb 1 AAbA i 


bAAAL HA X bA 




AGGGTTGTGC 


bb AL 1 I Ab 1 b 


Ab I bbbbbbb 


4j jbu 


CTCCCAAAGG 


GAAAGC L Tbb 


1 Abbb AAACG 




ACCGAGGCAG 


GGAGAAAAGT 


bb TTTGC C C T 


43 680 


AAACCCTGAA 


AACCTCAGGA 


GACTTGCAGA 


4374.0 


CAGGGGCAAG 


CCTCAGCTCT 


AGCAGTGCGG 


43800 


GGTGGCCTGG 


AGAGTCCTGA 


CCCCTGACGT 


43860 


TCTGGTGGGG 


GTGATGCCCA 


GGAATGTGAC 


43920 


GGAGGATCAC 


CAGGCGGGCC 


CTGACCTCAT 


43980 


GCCTGGTTGA 


AGAAATCGGA 


GGGATCAAAC 


A A- f\ A f\ 

44040 


GGGCCACGTG 


GCAAGGATCT 


GAGAACTGCT 


4.4100 


CTTAACTGCA 


AGGAAGTGAG 


TTCTGCCAAC 


44160. 


ACCCCAGGGC 


CTGCACACAA 


GAACTGAGCC 


44220 


ACTAAGAATT 


AGGAGGTAAT 


GAATGAGAGT 


44280 


GCTATGAAGC 


CATATCAAAC 


TAATATACAC 


44340 


GGAAGCCAGC 


TCTGCGAAGG 


AGGAAGAAGG 


44400 


GCAGCTGTGT 


CC.TCCCTGTT 


GGCCACCACA 


44460 


GGCAGACTCC 


AGCCCCGGGT 


GCTGGAGCTC 


44520 


TCTGGGACCT 


CCAGAGTCTT 


CTCATGTGGG 


44580 


GCACCGCAAT 


GCCTCACACT 


CATTGTGACT 


44640 


CTCATCTGGA 


GCACAGGGAC 


CAGGTGTGTG 


44700 


TAGATGTGCA 


CAGTGTGCAC 


TAGCAGGAAC 


44760 


AGGCAGGCAC 


ACGCAATGGG 


GTACTGACTG 


44820 


GGCATACACA 


ACAGCTGTAC 


Tb AL Ab L AL A 


A A Q Q f\ 


TCTCAGTGCA CTGGCAGGCA 


LALbLAAI Ab 


A A OA n 


ATAGCTGTAC 


TGACTGTTTG 


CCCCAATATA 


45000 


TCACCAAGAT 


CAAATGCAGA 


AAGTGCACGA 


45060 


TAGTGCTAGT 


TAAAAAAATG 


TTTTGACTGG 


45120 


CTTTGGGGGG 


CCGAGGTGGG 


CAGATCACCT 


45180 


ACATGGTGAA- ACCCCATCTA 


CCCTAAAAAT 


45240 


CCTGTAATCT 


CAGCTACTTT 


GGAGGCTGAA 


45300 


AGGTTGCAGT 


GAGCCGAGAT 


CCCACCACTG 


45360 


CCACCTCAAA 


AAAAAAAATG 


TTTTTCATAA 


45420 


TGAAATTAAC 


TCTGGCTTGG 


CTGGGCGTGG 


45480 
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TGGCCCACAC 


CTGTAATCTT 


ASP APTTTGG 


GAGGPTGAGG 


TGGGCAGATC 


ACGAGGTCAG 


45540 




GAGTTCAAGA CCAGCCTGAC 


PA APATGATG 


AAAP P P P ATP 


TCTACTAAAA 


ATACAaaaAT 


45600 




tagppgggpg 


TGGTGGCATG 


P'APP'TPTA AT 
LAUL 1 o 1 AA J. 


PPPAGPTAPT 

LLU rtLjV. A-rtV. L 


CAGGAGGCTG 


AGGCAGGAGA 


45660 




ATPGPTTGAA 


PPCAGGAGGT 


bbAbb 1 1 vjV-A 


pppappppag 


ATPGPAPPAP 


TPTATTPPAG 


45720 


5 




AGAGPAAGAC 


TL iblb 1LAA 


aaaaaaaaaa 


GA AAGAA AGA 


A ATT A A PTPT 


45780 1 




bbb 1 bb 1 AbA 


appagpppta 


TATCTLAbbA 


p*p* a p* a P'TP* AP"* 
bbAL AL Jl b AVj 


TP ATTP a ap A 


PAPATPTPTP 
AV, A XL lull.' 


*± J O *4 U 




a apf" apt'TPP 

AAbL AC C 1 


TPTATGPTGG 


AGCTGTbbbT 


a /T , T>f~*~?\ p , p* A a 
Abb 1 LAbb AA 


T"T7iP! A (~LCl~h Afl 


APPPP APPPP 


d j u \j 




1 Ab AbvjAJj i I 


pptgappapc 


CCAGGCCAGC 


ALbL ILL I ai 


APPAPPTPftP 


A Af^PAPP APA 


4 J J DU 




1 bAL 1 b AbAL 


TTP A P.P TP A G 


rrV'f'"* A P* 7\ C f~* A P* 

TC C AC Abb AL 


IPPPTTTTPT 
AbbL 1 11 1H 


pp:pp APTPPT 


PTPAGGAPAT 




10 


b AbA 1 b 1 b 1 b 


PPTPPAA A AH 


GTAAACTbLT 


r , P , P ,r rr , P"T , P"' a p 1 
bbb 1 bb 1 tjALi 


P A PPPTPTP4P 


PPAATPTPPT 


*± U v o u 






TPPPTPAPTT 


TCTCACLLAb 


AAAb 1 b 1 bbA 


P A ATOAPAPJ"* 


APTTATPTPP 


4014^ 




p 1 <t» c p* c c f P 1 p* 

b 1 bbbbbbbb 




CTGTAATCbb 


A iT 1 A P 1 ' P' P'VP'P"' 

Abb Ab 111 bo 




PPPPTPPATP 


1 □ Z w U 




ALLTbAbb 1 b 


AbbA^ji ltnn 


GACCTGCCTG 


bb b AAL Ab bb 


TP AAA PTPP A 


TPTPT APT A A 
1L1U1 ftt 1 AA 


/i fi*? fin 




AAA T» A rp A A A A 

AAATATAAAA 


a A. TT* AY" PP'PT"' 

AA1 lAbCCoo 


GCTTAGTGGT 


i^P* A P 1 APPTPT 1 

bbAbAbb 1 b 1 


aaTPPP A PPT 
AAi v.H-AVjk- 1 


APTTP APAPP 


4uj^U 


15 


CTGAGGCAGG 


AGAATCACTT 


GAACCCAGGA 


PPiTipPAPPTT 1 

GGTbbAbb 1 1 


CC APTPAPPP 
bbAo 1 VjAVaCU 


A APATTPTP1P 
AAu A 1 1 \j 1 Kj^. 


40 JOU 




CACTGCACTC 


CAGCCTGGGC 


aAAAAGCCAA 


AACTCTGTCT 


CAAAGAAAAA 


APA ATP ATPP 


*± D *4 ft U 




CAGAAGGTGA 


AGTCTATGTT 


AGTCCCAGTT 


CCCAGGTCGT 


ACATGGCGGC 


A PP APA A APA 
AkjoAoAAAoA 






GAGAGAGAAG 


GGGAAACTGC 


CACTTTTAAA 


CCATCGGGTC 


TCCTGAGCAC 


TP* A. P* TP" TP SP 
1 bAb 1 b 1 b Ab 


f4 0 D 0 U 




AACAGCCTGG 


AGGAAACTGA 


CCGCATGATC 


CAACCACCTC 


CCTCCAGGTC 


Lblbbb ILbA 


ft 0 O ^ VJ 


20 


CACGTGGGGA 


TTACAATTCG 


AGGTGAGACT 


TGGGTGGAGA 


CACAGAGCCG 


a appiTaTPa 

nnv-LA 1 A 1 b A 






GCATGTATGG 


GGGGCACTGA 


AACTTGTGCT 


TGGTGCCCAT 


TCATTCAACG 


APTPTPTPTP." 
Au lb ib Ivj lb 


46740 




GCTGGTCTCC 


TCATCTTCAA 


CTCCCTGCCG 


AGTCTCAGAT 


AGGCAGCCTG 


P APTTPPTTP 
Unb 1 XLLl IV. 


** O □ U 




ACCACAACAG 


GCACATGGGG 


CTGGGTGCCA 


GTGAGTGCTG 


GGGCTTCTCC 


G AGP AP T ATP 


46860 




TCACACCCAG 


GAGCGTGGGC 


ACGCATGGCA 


TTCGCATGTG CCGTCAGTGG 


A P ATT AAA P A 


46920 


25 


CAGCCATGAA GAAGCCACGA AGAAGTGCTG CCTGCCGGCC 


GTGCGCGGTC 


ACGCAGCGCC 


46980 




AACTCCCTCC 


TGGGGCCTTC 


TGGGGCCTTC 


TGGGGCATGG 


GAGCTGGGGC 


bbbb TbAbAb 


A 1 O A Pi 

4 /U4U 




AAACATCCGT 


GACGCTGGGC 


TGACCCCACA 


GAACGGTGCG 


GGCCTCGCTC 


TTGGAGTCAG 


47100 




CCCTGCTGCC 


AGCCAGTGCC 


GGGTGCTGGG 


GACTCAGGGA 


GGCCCGCCGG 


GACCACTGCG 


47160 




GGACAGTGAG 


CCGAGCAGAA 


GCTGGAACGC 


AGGAGAGGAA 


GGAGAGGGGG 


CGGTCAGGGC 


47220 


30 


TCTCAGGAGC 


CGGGTCCTGG 


GCAAGGCGCA 


GCCGTTTTCA 


AATTTTCAGG 


AAAGCGGTCG 


47280 




GCTCACACTC 


GAGCAGTAAA 


AAGATGCCTC 


T.GGGGAGGAG 


GCCCGTGCAG 


CTCTCCGGGC 


47340 




AATGGTGGTG 


GCTCGGCGTA 


GAGAGGCGGT 


AGTGGAACGC 


AGACCCTGGT 


GGGGGAATGA 


47400 




CATCAAGGGA GGAGACGGGC 


GGGACCCCAG 


ATTTCTGCCT 


GTGGGCGATG 


GAAGTGAGGT 


47460 




TCACTGGCCA 


GCGGAGCCGG 


ACACAGAACG 


CGCAAAACGC 


CGTGTAGGCC 


TGGAGGAGCC 


47520 
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GAAGAGCAGG 


CGGACCCCCT 


CCGCGGGGGA 


ACAGTTTCCG 


CCGGGAGCAC 


AAAGCAACGG 


47 580 




ACCGGAAGTG 


GGGGGCGGAA 


GTGCAGTGGG 


CTCAGCGCCG 


ACTGCGCGCC 


TCTGCCCGCG 


47640 




AAAACTCTGA 


GCTGGCTGAC 


AGCTGGGGAC 


GGGTGGCGGC 


CCTCGACTGG 


AGTCGGTTGA 


47700 




GTTCCTGAGG 


GACCCCGGTT 


CTGGAAGGTT 


CGCCGCGGAG 


ACAAGTGAGC 


AGTGAGTCGC 


47760 


5 


AGTGACCCTA 


CAAGTGGTTC 


TTTTACCCGA 


GCGGCTCGTA 


GGCGCGTTGC 


GGTTTTTCGA 


47820 




AACTACAGCT 


CCCGGCAGGC 


CCCAAGCCGC 


CCTCGGGGCC 


GCGGGTCGGC 


GGATTGGCCG 


47880 




CGCTGCATTT 


TGGGACCTGT 


AGTTTCCTGC" 


GCTCGTGGCG 


CTGGCGCCGC 


GGCGTTGGCT 


47940 




GAGCCCTTGA 


CCGGGGCTGG 


AGGGAAGGGC 


CGACATTCAG 


TGTGTCCGCG 


TCTGTTCTGT" 


48000 




TAGTCCCAGT 


TCCCGGGCGG 


GATTGAGGCT 


TAGAGAAGTT 


GAGTGATTTG 


CTGAGGGCTG 


48060 


10 


CACGGGTTGG 


CATCCCGGCA 


TGCTCTTTGG 


CTACTTTGGC 


TGCATCTGGT 


TGCCCACCCG 


48120 




GGCGGATGGG 


GAATGGACTC 


CAGCCAGCCA 


GGAGGGCAGA 


GGGCTGGAGA 


GGCAGGGCCG 


48180 




GAGGTTCAGA 


CCCTCCGCTC 


TGACGTTGCG 


CCTGGTGAGG 


CCGGGAGGGG 


TGCCGCTTGC 


48240 




CTCTTCAGCC 


CTCACGCTCT 


TGTGGAAGTC 


GCGGAATTAC 


TGCAGGCGGA 


ACTTGCAGCA 


483O0 




CTGTGGGCGT 


CTTTTCCAGA 


GAAGGACGGA 


GTTGTGGGGC 


GGGAGGATAA 


GGCAAGGCCC 


48360 


15 


AGCCACTTCG 


CATCTTCGCC 


CCGCCAGCTC 


CTCGAGATGG 


G AT AT AC C AG 


GGTTGCTCTC 


48420 




CAACCCTCTC 


CGCAGGAGGG 


ACTGATGGAA 


ACGCCTGGGA 


AAGTAGCCCG 


GTACCCACAA 


48480 




AGGCTGTCTA 


CAAACAGAGT 


CTTACTGTCT 


TTCCCAGGTC 


TGTGCCATAG 


GGATTCTCGA 


48540 




AGAGAACAGC 


GTTGTGTCCC 


AGTGCACATG 


CTCGCATCGC 


TTACCAGGAG 


TGCCCGAGAC 


486O0 




CCTAAGATGT 


TCGGAGTGGT 


TTTTTCGCAC 


AGACCCGAAT 


AGCCTGCCCC 


TCAGCCACGC 


48660 


20 


TCTGTGCCCT 


TCTGAGAACA 


GGCTGATATG 


CCCAAGATAG 


TCCTGAATGG 


TGTGACCGTA 


' 48720 




GACTTCCCTT 


TCCAGCCCTA 


CAAATGCCAA 


CAGGAGTACA 


TGACCAAGGT 


CCTGGAATGT 


48780 




CTGCAGCAGG 


TAGAGCACAG 


GCCCCGAGGA 


AAGGACTGCG 


GGTGGGTGGA 


GCTTCAGCCA 


48840 




GGACGGGGTG 


TGCTTCCCTC 


TCCCGGCCCA 


TTCCAGCCAG 


GCCCCTCCGG 


GCCAGAGGCA 


489O0 




GCGTCTGTCA 


TAAAAAGGGC 


TGGTGTTCCA 


GGTGGGGTCA 


GAGAGAGGAT 


TGACAAGTAA 


48960 


25 


■ AAACGATCGT 


CCTTTGAAGG 


GGGCCGGCCC 


CTCCACACCT 


GTGGGTATTT 


CTCATCAGGC 


49020 




GGGACGAGAG 


ACTGAGAAAA 


TGAATAAGAC 


ACAGAGACAA 


AGTATAGAGA 


GAAAAGTGGG 


49080 




CCCAGGGGAC 


CGGCGCTCAG 


CATACAGAGG 


ACCTGCACCG 


GCACCAGTCT 


CTGAGTTTCC 


49140 




TCAGTATTCA 


TTAATTACTA 


TTTTCACTAT 


CTCAGCAAGA 


GGAATGCGGC 


AGGACAGCAA 


49200 




GGTGATAGTG 


GGGAGAAGGT 


CAGCAAGAAA 


ACGTGAGCAA 


AGGAATCTGG 


GTCACAAATA 


49260 


30 


AGTTCAAGGG 


AAGGTACTAT 


GCCTGGATGT 


GCACGTAGGC 


TAGTTTTATG 


CTTTTCTCCA 


493 20 




CCCAAACATC TCGGTGGAGT AAAGAGTAAC AGAGCAGCAT TGCTGCCAAT 


ATGTCTCGCC 


49380 




TCCTGCCACA 


GGGCGGCTTT 


TCTCCTATCT 


CAGAATTGAA 


CAAATGTACA 


ATCGGGTTTT 


49440 




ATACCGAAAC 


ATTCAGTTCC 


CAGGGGCAGG 


CAGGAGACAG 


TGGCCTTCCT 


CTATGTCGAC 


49500 




TGCAAGAGGC 


TTTCCTCTTT 


TACTAATCCT 


CAGCACAGAC 


CCTTCACGGG 


TGTTGGGCTG 


49560 
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GGGGACTGTC AGGTCTTTCC CATCCCACGA 
AACCTTGGGC AATACCCGGC TTTCCAGGGC 
GTGCCCCTGG TTTATCGAGA CTGGAGAATG 
"AAACATATTG TTAACAAGGC ATGTTCTGCA 
5 TACAACACAT GTTTCTGTGA GCTCAAGGCT 
GGGCAAAGCA ATTGTTCAGG GTACAGGTCA 
CATAGACACA GTAACAGTCT GATCTCTCTT 
AGGAGTGCCT TGGGGGCCTC TCTGAGGAGC 
TTCCATTAGA GGGGCAGCCG GTGGCCAGCC 
10 GCAGGGAGGT GGCAGTGGTG CAGGTTGCCT 
ACCCGGCTCA CACCCTTCCC CTCCCCGAGT 
TATGTAGGAC GTCAGATAGG ACAGCATAAA 
CTATTTTGAG ACTTGACACC TAATTAGTCA 
TGTTTTGGGT AACGTTCTGC AAATCGTTTG 
15 TAGGGCAAGA GTGGGACTTG CCTGTGGACT 
TGGCAGTGGC CTCTGCATCT GCAAAGAGCT 
CAGCTTGTGT GGCCTCGCCT CTTCCTGGCT 
AGAAGGTGAA TGGCATCCTG GAGAGCCCTA 
GCACCAGGCT GGCCTGGCGA GAACACCTCC 
20 AGAGGGCGCA AGGAGAGCTT TTCCCGGATC 
CTGCTGCTGG AGACCCCATA GGTGACCCTA 
GATGGTTGGC AAGGGATGGC GCTGAGGGTG 
TCAAGCAGAA CTCAAGGAGA ATTTTTTAGC 
AAACCTAGGG TTGGGCTTTT TTGCTGAATT 
25 TTTGATAAAC CAGTATCTGG GGTGTCAGAT 
AATGCCACCT GCCTTTGATA CGTGAGAACG 
TTGTGTCTGG TCAGCTTGCT ACACGGACAT 
CTCGCAACTC ACACAGGTCA TCAACGAGCT 
AGTTTACACC TGTCTCGGGG TCCTCAAGAG 
30 CCCCATGCTG TGCTGTGGTG GAGGGTGGTG 
GAGCGCTGGT GCCCAGGGGT GGGGTGCGGC 
AGGAAGCAGC TTTGCATCTG TGTGCTGACC 
GCTGTTGTAG CAGCTGTGCA GTAGGTCAGG 
GTAGGAGGAG GCCCTGCTGC TTGTCAGAGC 



bbCLAlAi 1 1 


twrtbrv^ irVi^-rl 


p&TP,pap,i\f^zx 




AbAbbTC L L 1 




bC Ab I bbA i b 


4 J OOU 


G C GATG AC TT 


I rALLAAbLA 


T^PTPPPTPT 

1AL ibLU lb 1 


A 07 A Pi 


CAGCTCTAbA 


T 1 PPP , T"T 1 7\ JAP 

IbbL 1 1AAAL 


P TTP 21 ttp p a 
b 1 lbn A lbbA 




GGGGCAAAGT 


TAbAGAIl AA 


P* A PP A rn/- ,r PT l A 

bAbLAibi 1 A 


o o U 


AAATGGAGTG 


TGTTATGTCT 


TCCCTTTCTA 


4 9 y I U 


TTCCCTACAG 


TCCTTGAGGG 


fTl/** TV f* 1 TV ^ TV V *T^f^ 

TGACAGACTT 




AGCTGATATT 


CACGGGTCAG 


Z - * TV /"* /T TV TV /— ' TV m 

GAGGAAGCAT 


50040 


TCACTTGGAA 


GGTCTTTGAA 


CCTLGGbbbl 


cni nn 


TCTCCTGGGT 


TCCTTGAGGT 


GCCCTCTTGT 


cni en 

DOlfaU 


TTCCTGCTCA 


GGTTCCCGTC 


TG Ab AGL TTb 


DL/^Z U 


TGTTTGGATC 


CAGAAACGCA 


GAACAGTTTC 


bU2oU 


TCTTACTATT 


TAAGCTGAAA 


T\ TV fT\ TV /"^/^i™ 1 1 f V/™» #^ 

AATAGTGTCG 


5034 0 


CTAATGGCGG 


CTGAGTTGCT 


TCACGCCCTT 


50400 


TCTCCGCGGT 


CCCACAGGGC 


TCTCGCCACC 


50460 


GCCCGCTGGC 


TGCCGAAGCT 


TGTGTCAGGG 


C A c o n 


TCCCCGTAAC 


CCTTGCTCCG 


AACTCCGTTC 


50580 


CGGGTACAGG 


GAAGACGCTG 


TGCCTGCTGT 


50640 


GAGACGGCAT 


CTCTGCCCGC 


AAGATTGCCG 


50700 


GGGCCTTGTC 


ATCCTGGGGC 


AACGCTGCTG 


507 60 


GTTCCCAGGC 


CTCTCCTGGC 


CTCCTGTGGG 


c r\ o *") n 


GGGTGGGCCC 


ATGGGGACTC 


CTGCCGTCTC 


bUooL) 


TGCTGTATAA 


TTTGTCGCCA 


m^~ , p , m/^' p* p , T , P ir P 

TCbTbbbTGT 


C Pi Q /I Pi 


AGGGCACGGC 


AG A I GLbLAb 


TTPapppa'TT 1 
I ILAbbLAl i 


ci nptPl 


TCTTGGCTGT 


L ibbAbbbCb 


bAbi, lAbbbb 


J XVJ o u 


TTGTCTGAGA 


ACCGTGACTT 


CTGTGCTTGC 


51120 


CCCAAAGATT 


ATTTACGCCT 


CCAGGACCCA 


51180 


TCGGAACACC 


TCCTACCGGT 


GGGTCAGACG 


51240 


AACCAGCTTG 


GCATGGTGCT 


GAGTCCACAG- 


51300 


GTCTTTCTAG 


ACGCTCCCCC 


GAAGTGTGCA 


51360 


CTGGGCTGCC 


TCCAATGCCC 


ATTACTTGTG 


51420 


TTGGGCGGGC 


GTCCTGAGCT 


CCTCGCAGGT 


51480 


GCTGGCCCCC 


AGTGCAGCTT 


TGCACATGAA 


51540 


CCAGCAGAGT 


CTTGGTGTTC 


TGTCGGGTTC 


51600 
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CTGTGGCCGG ACCAGTGGCA GGGTGCTGTG. GAAGCTGTCG AATCTCCTCC' CTCTGTCCAG 51660 

TACCCCCGCT CGTCTTCTAG CTCCCTCCTA CGCCCGGGCC ACGTTTCAGT TATGCTCACT 51720 

TCCTCTGACC GCCGAGGCTC CTGCGTGTCT CCATACAGCT CACGCTGCAG GGCCACGCTG 51780 

TGGGTGTTGG AGACAGCTCC TCCTCGACCC ACGGTGCTCT CTCCCACCAG GCCTAAGGTG 51340 

5 TGTGTGCTGG GCTCCCGGGA GCAGCTGTGC ATCCATCCTG AGGTGAAGAA ACAAGAGAGT 51900 

AACCATCTAC AGGTAGGCTC CTGGGCTCCC GCTCCGGCTC AGTGTCCGAC AGGCGAGTGC 51960 

TGCTGGGTGT CCAGAGCCCC AGGCTGCGCT CCCGCTGGGC TAGGGTTTGA AGTTCACTGG 52020 

GGGACTGCAG GGGAGGACCT GGTGGGGGTG GGGACTGGCT TCGGTCCTTT CTTGGCCGTG 52080 

CTTCAGCTGC GCACTCTGCC CTTCCTCCCA CAGATCCACT TGTGCCGTAA GAAGGTGGCA 52140 

10 AGTCGCTCCT GTCATTTCTA CAACAACGTA . GAAGGTACAA GCAGCTGGGT -GGGACCAGGG 52200 

TCGGGTTGGA GTGTGTGCAG CCTCTCAGGG TGGAGCTCAG TGGTGTCACA GCCTGGTTGT 52260 

GCTTGCCCGG TGGGGCGGCC AGTGCGGCCA TGTACCTGGG CCCTGTCTTC TGACTCGGGG 52320 

CCACCCATGT TAGACTTCTG TGTGGAAGAG. CTCACACAGT GGTCTGAGAC AGCCAGCCGG 52380 

CAAGACTGCC TCTGGCTGGT GCCTGGGGCC TTGGATTTTG GGAAGGCTCC CTCCATTTCC 5244 0 

15 TGATGAGAGG GTCTCCCTGC ACCTAACCTG CTGGTGCAAA CAGTAGGGGT TTTGCTGAAC 52500 

ACCGGCTTTC TCTTCGGGGA CTTTGTTGCT TGCCCAGCAG CAGGTGCTCC AGTGACCGGC 5256 0 

CCTCATACCA TCTTGGGAGG GTGTCCTGGA AGCCGTGTCT GGCCTCCCGC GACCCTGCCC 52620 

CGTGTGTCTT TTTCCTGTGC TGACCTTGCT GCGGAAAATT ATGGCCCTGA GTGTGACTCC 52680 

AGGCTGAGTC CTGTGGGTCC AACACGGGAT GCCTTGGGGC CTCTTCTGGA GACGGGATGT 52740 

20 GAGTGACAGG AGCCGGCCGG GGCAGCTTGC CCTGTGACTG CACGTGGCCA CAGCCTGTGA 52800 

GGGCCGGGGG TGCTTCTCCA CCCACGTGGC TGCCCCTCGG GTATGTCAAG GGCTTCTGGG 52860 

GCTCATCACG GGGTCCTAGA GACAGTGGCA GGGTGCACCC CCGTTGGCTG CCCTTACAGT 52.920 

TTCTGTGACC TGAGGGTGGC ATCTGTGCAG TCGGCGCGGT CTGTGCTTCT GTGGGATCAG 52 980 

GGTTCCCTCT GTTTCCTGCC TCAGTTGGGG ' CTCAAGCCTC AGGTGAGGTG. GCCCCGGAGC 53040 

25. ACTCAGAAGG CATCGGCGGT CCTGTGGGCT GCTTTCTGCA CTCACGTTTG CTGAGTGCTC 53100 

AGTGTGCCAG GACTGAGGAC CCTGAAGCTG CTCTTGTATT TAGGGCGGCG CTCCCCTGGC 53160 

AGAGACTGAG CCAGGTGGTC CCGCATGACC CACTACCAGG CGTTTCTGGG CCCTGGCCCT 53220 

TGGAGGGACA GGGTGGGCGG AACATGGGCC TGCAGGGAGG CTCCCGCTTA CTGGAGGCAT 53280 

GTGCTGTGTT GCTGGAGACA. TCCTCTGTGT TGCTTCTTGT TCGCTGTGGT TTTTGGTCTG 53340 

30 GTGGCACCAA GGACCCTCAG TCATCTTGAT GTGTGGTTGT CCAGGCCTTT TTGTTGGTCC 53400 

TAAGAAGGGG CTCTGCCTTT GTGCCCCCAG GTTCCCTGAC AGGAGCTGCC GGCTCGTCCC 53460 

GGTGATGCCT GCAGGACGTG ACTCTGGGAC GGGGGGTTGG GCAGATGTGC TGATGGAAAT 53520 

TCTCAAGCAG GCGTCATTTC CGAGGTCCTC ACCTGGATTT CCAGGACAGG AGTGCCTGCT 53580 

GGGTGTCCCC AGTCCCATGC AGCGGGGGTC CTTGGGATAG CATGGAACGC TGAGCATGGG 53640 
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CCTGGCCGGC CGTGGTCCTG GACAAGGGCA GTGCCCCGGT GGCTGCTGGG CCTGGGACCT 53700 

GGTGGGGACG CTGGGCCTGG TACCTGGTGG GGATGCTGGG CCTGGGACCT GGTGGGGAGG 53760 

CCTCTGACTG CCTCCTGGTG CTGCTTCCGT CTGTGTTAGG CCTCTGGGTA TTGGGGCCCC 53820 

CATCTGTCTC CTCCTCCAGG CCTGTGGACT CAGACCAGGA AGACACAGGC CAGCCCCTGC 53880 

5 CTGTCCCCCT TGGCTTGGGC TCTCACTGCC CGACCTGGCG GGAGGTTGCC TAGCCGTGAA 53940 

CCTTCGCACC CTGTCTGCCA CCGGACAGGC TGTGAGGGGG TGTCTGCAGC ACCTGCACCG 54000 

GCCTGAGCAT CTTCAGAGTG GGCTGCAGCT CCTGGAGGGG TCTGAGAGGA AGGGAGGCAG 54060 

GTATTTTGGG CGAATGAGGA GACAGCTGGA GAGCTGGCAC CCTTCCTGGC CTGCGTCCTG 54120 

TGAGGACTCT GGTTGGGGAC AGCAAGCTTG GGGTCAGCCT GGGGCAGAGC CTCTGGGACG 54180 

10 GCCCCGCCCC TCGTGCCCCT TCCCCTCGCA GCTCCTGTCC TCGCCCCGCC CTCAGCTCTC 54240 

CGCCAGGCAA GGTTTGGCAA GTGCCGCTGT GCGGCAGTGC CTGCTGATTG GCTGGTCTGT .54300 

TGCTATGGTG CTGCCCAGGG GTGTGCTTTT CCTCCCCTGC CTTCCCTGCT ATCCCTGGGA 543 60 

GTATCTGGGG TTGGGTCATC GCTGGTGTGT GTGAGTGTGT GTGTGTGTGT ATGTGCACGT 54420 

GTGCATATGT GTGCGCTTCT GGCCTCTGCA GCTGAGTCCT GGCCCTCGGG GGGCCTGGCA 54480 

15 CCTCCTGGGG ACAGGCACAA AGCAGCCATG ATGGAGTCGG GAGCTGGGGG AGGCCCCATT 54540 

GCCCCACGTG GCTGCCCTGT GACTCTGGGG TGCTTGTTAG AAGAGGTATC TGGTTCTGTC 54600 

TGTGTTTAAG CAACTCCCTA AGGAATTCTT GTGGTTCCAG TTTGGGGGGC CTGTACTGTA 54660 

GAGGCAAGGG AGGGGCAGGA CATCCCCCAG ACTCTGACTT CTGAAGCCTT TTCTGCCCGG 54720 

GGCCTCTCCG CCAGTACAGG CAGTGTCCTT TGCCAGGGCT GCCATGCTGC AGAGGGGAGT 547 80 

20 GGGCCACTGT TTAGCCCAGG AAAACCTGGC TCTCCCTTAG CTGGAAGTTC TGGGCCTGTT 54840 

GTGGTTGGCA GGGAAGCTGA GTGACGGTGC TAATCACAGG GGCACCTGCA GGGGTTTGTG 54900 

GGAGATGCCT CTGTGGGTTG GGGCGATAGG CTGAGGGGCT GTTCTTCCCT GCCCTGAGGA 54960. 

GGGCTGAGTG TAGCCGCCAC TCCTGTCCTG TCTTGGGCTG TCTCGGAGAG GATGCGTAGA 55020 

ACCCTCGGGA TCCTGCTGGC CTCCGTCTGG TCCACCCTGA ACCTCAGGCC TTCTGGGGGC 55080 

25 AGAGGAGGAT T£CCTCAGGA TCACTCGGGT GGGGGCCTCT CTTGGGCACC TGAGACCCTC 55140 

AGTGGGTGCT TTGTGGCGCG TTCACGGTTG GTGGGGGACG CCCAGCCCTG CCCGCCGTGT 55200 

AGGAGCCGTT CTGTCCTGGG CATCCCCCTG TGGTCTGGGA CTTAGTGGAC CCTGAGGGTG 55260 

TGTGTTTACC CCTGCCTCAC ACCTGCAGAA AAAAGCCTGG AGCAGGAGCT GGCCAGCCCC 55320 

ATCCTGGACA TTGAGGACTT GGTCAAGAGC GGAAGCAAGC ACAGGTGAGA CCCCTCAGTG 55380 

30 AGGCCACGAC CACTGTCCTT CCATGGCCCA GCTCTCCTGT GACCTGTGGA GGCCCGGATA 55440 

TATTTCTTCA CTTTTCTTTG TTCCTTTTTA AATTATGAAA CTAACCACCA TTCAGTACGA 55500 

AAAAGTTTAA GCAGCTCTGA GGAAGATAGA GTAAAAAATT GTCTCCCTCT TCCCTGGCCC 55560 

TCAGCCATCC CCGGTGGCCA CCGTGGAGTG TGGACGGAGC CCTGCAGGCC TGTGTCTGTG 55620 

CGGAAGCACG' CGCAGTTTTG TCTGCACAGA CTGTCCTGCA GTTGGCTGTT T.TCACTCAGC 55680 
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GTTGTGGGTA TAGCTTCCCA TGCTGGTGCT 


GGCAGCTCGG CCTTGTTCTT 


TTGAGGACAG 


55740 




CAGATGTCTC 


CTATGTCTAC 


CTCTTACAGC 


TTCAGAGATT 


CAAGTTATAA 


TAAAGCTCTT 
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CTTATATTGA 


GGGGGAAACC 


TCCCTCCCCC 
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AGTGCAGTGT 


CACAGTCTTG 
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V, itwL v. J. n 
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T P. T A P A CI A P A 
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11 1 l i in ii 
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/-t qi p rp p p p rp p rp 
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GTCGCCCAGG 


CTGGAGTGCA 




CTCGGCTCAC 


TGCAAGCTCC. 






10 


TTCACGCCAT 


TCTCCTGCCT 


p apj^ptppph 


AGTAGCTGGG 


ACTACAGGTG 




j u ^ o yj 




CGTCTGGCTA 


ATTTTCTGTA 


rnrnrrvprn t\ prp 7\ P 
1111 Inulnu 


APiAPGGGGTT 


TCACCGTGTT 


APPP APPATP 


JOJ4U 




GTCTCGATCT 


TCTGACCTTG 


TP ATPPPPPP 


APPTCGGCCT 


CCCAAAGTGC 


lubbnl 1AV_A 


JO'iUU 




GGAGTGAGCC 


ACCGCGCCCG 


ppppapptpt 


P.P TTTTTPTT 


AGTGGTTCTG 


PPTTPTPTTT 


^fi a fin 

JOlDU 




GTTTCTATCC 


AGGAATAGGG 


TfPPTTTT AP 
1 1 LjLj I 1 L 1 AV- 


TTTTPPATCG 


AGTTTTTAAA 


p apippappa 


^ fi^9 ft 


15 


TTTACATGGT 


CGGAAACTCA 


p c a p_p a ptpp 


PPATCCCTTG 


GTCGGAAACT 


P AP ATPPAPT 


JOJOU 




CCCCATCCCT 


TGGTCAGAAA 


v, 1 \- AUo 1 oOA 


PTPPPATPPA 


TCCCAGGCAG 


p arpTTPPP a. 






CCTGGGCCCT 


ACGTGCAGGA 




TTPPHGGTCA 


GAAGACATGG 




c r 7 n pi 
jo / UU 




GCACCGTCCC 


CTGCATGGGG 


tpptph rii^r. 


ATPTTPTPPT 


CTCTCCTTCC 




^ fi7 fin 




CCTTACTACC 


TGTCCCGGAA 




PAAP.PCGACA 


TCATATTCAT 


pppPTapa 


JOOZU 


20 


TACTTGTTGG 


ATGCCAAGGT 


rrr'pr'PTr' a p. 


TPPTPiTAGCT 


GACGACTCCT 




JOtJOU 




GGTGTCCCTG 


GGCTTGGGAA 


CTxCCVCTCCd 

r\Vj<_ 1 Vj J. v_ V. \j 


AGCCTTTGCT 


GCTTCAGGGC 


PTT1P ATP AP 


j O J *± \J 




CAGGCCTGGG 


TGGGAGGACT 


P APPTPTP/TP 


ACTGGGCAGG 


GGCTCAACCT 


CICICC AP AP AP 


57000 




ACTTGTGAGC 


AGCCCCAGGC 


PAP AGGTC AG 


TTTTCTGAGC 


AGTCTGGGAG 


PPPPPAPPPT 


57060 




GGTGGGAGTG 


AGGAGAGACC 


TCCAGGCTGT 


GGTCCATAGG 


CCAGTGCCCG 


PTPTTGATPP 


57120 


25 


TGACAGCTCA GGTTCTCTCC 


TTCACGTCAG 


GCCATGGGAG 


GCACCGAGAA 


CACAGGAAGC 


57180 




CCACTGACTC 


CCCTCTTCCC 


AGCGCGTGCC 


CGGCCCCACA 


CTCACTCCCC 


CTCCLAtiCAT 


c n o A n 




GTGCCCGGCT 


TCACACTCAC 


TCCCCTCTTC 


CCAGTGCATG 


CCCGGCCCCA 


CACTCACTCC 


57300 




CCCCACAGCA 


TGTGCCCGGC 


CTGACACTCA 


CTCCCCTCCT 


CCCAGTGTGT 


GCCCAGCCCC 


57360 




ACTCCCTTCC 


GCCCCGTGTG 


CCCAGCCCCA 


CGCTCACTCC 


CCCCGCCAGC 


ATGTGCCCGG 


57420 


30 


CCCCACACTC 


AACTCCCCTC 


CTCCCAGTGT 


GTGCCCGGCC 


CTGCTGCCCT 


CCTCCCCATG 


57480 




TGCCCTGCTT 


TTGTGCCCCA 


CACTTTTTAC 


TTAGTGCAGG 


TGGGATCACA 


CGCCACGGGT 


57540 




CAATGGTTTG 


TGTGTTCACG 


TGACGATGGC 


GTGGTGACGT 


TTCCAGATCC 


CGTCGTTGGT 


57600 




TCGCTCATTC 


T.CGGGGTGTA 


TATTTATTGA GAGCTCATCA TGCTGGGTGC 


TATTCCAGGC 


57660 




ATAGCAAGAC 


TGGCTTCACT 


CACATGGAGC 


TTTGATTCTA GTGGTGGGGA 


CAGGTGGACA 


57720 



-49- 



WO 01/42434 



PCT/US00/33065 



GCAAAAGAGT AAGCACGTGA GCTGACGATA CTGAAGGGAA ATAGAGCAGA GGGAGGAGGC 57780 

GGAGACCGAG CCAAGCGGGC CCAAGTGCGA TGTCGGCGGG AGGTGGGGAA TGCTGGTGGG 57340 

TCTGAGGGGA GCCTCAGCAG GTGCAGCAGA GCAAGGGAAG AGGTGAGTGG GGGCGGCTGG 5-7900 

GGGGCCGACT CCTGGGAAGC TGTAGCAGAA CCCCACAGAG AGCTGGTGAG GTTTGCCGTG 57960 

5 GTTGTGGGTG ACTCGGTGCT TTGAGCCCTG GCTGCCCCTG GGAACCATCT GGAGAGCTTC 58020 

TAACCCAACC AGGCCCCTCC CTGGGACAGT TATATCACAG. CTGGTAAGCC GAGTCTAACA 58080 

CTTTCACGGA AACGCAGAAG ATCTAAAACA GCAAGATGAC CGTGAAGAAG AACAGAGCTG 58140 

GAGGACTCAC CTCGCTGGTT TCAAGACTCC TCTAAAGCTG CAGGAGTGGA GGTGGAGATG 58200 

GCCCAGCTCA GGCACAGGCC TGCAGGCCAT GGAGAAGGCA GCAAGCTCAA GCTGACCCAC 58260 

10 ACGCATGTGG TCATTGTTTT TTTTTTCAGT TGGAATCTCA CTCTGTCACC CAGGTTGGAG 58320 

TGCAGTGGCA CCATCTCGGC TCACTGCAGC CCCCGCCCCT AGGTTCTAGC GATTCTCCCA 58380 

CATCAGCCTC CCGAGTAGCT GGGATTACAG GCGTGCGCCA CCATGCCTGG CCCTTGGTGA 58440 

TTGTTTTTTG ACAAACATGC CAATTTAATT GAGAGAGGAA ATGAAGGTTG ATTTCTGGTT 58500 

TTCTGAAAAA ATGGTGCTAA GAACAGCTGG ATATCTGTTC GGAAAACAGT GAATCTTAAC 58560 

15 TCTTGTTTTA CCCTGTATAA ACCTAAATGT AAAAGCTAAA CTAAAAGTTA TAGAAAGGAA 5862 0 

CATGGGGGAG GTCTTTGCAA CTTTGGGGTA GGCAGAGATT TCTTAGTATG GATACACAAG 58680 

GCACTAGCCA TGAAGAAAAA CATTAAAATT TAGACTTCAC CAAAATTTAA AGCTTCAACT 58740 

CTGTGGAAGA GTTGAGAAAA TGAAAAAGCA GTTAAAGAAA GGGAGAAAAT ACTTCTTTCA 58800 

AAGGACTTAA AAAATTTTTT CAGCCCTCCT CTGATTTGAA AGGACCTTTG ACCAGAGTAT 588 6 0 

20 GTAAAATTCT CCCATAACTA AGCAAACAAC CCACTTAACC ACTGGGAAGG GATCTGGACA 58920 

GACGTTTCAC CAAGATGGGT GGAATGGCCA GTTAACCACT GGGAGAGCAT CCGGACAGAC 589 8 0 

GTTTCGCCAA GATGGGTGGA ATGGCCAGTT AACCACTGGG AGAGCATCCG GACAGACGTT 59040 

TCGCCAAGAT GGGTGGAATG GCCAGTTAAC CACTGGGAGA GCATCCGGAC AGACGTTTCG 59100 

CCAAGATGGG TGGAATGGCC AGTTAACCAC TGGGAGAGCA TCCGGACAGA CGTTTCGCCA 59160 

25 AGATGGGTGG AATGGCCAGT TAACCACTGG GAGAGCATCC GGACAGACGT TTCGCCAAGA 59220 

TGGGTGGAAT GGCCAGTTAA CCACTGGGAG AGCATCCGGA CAGACGTTTC GCCAAGATGG 59280 
GTGGAATGGC CAGTTAACCA CTGGGAGAGC ATCCGGACAG ACGTTTCGCC AAGATGGGTG 59340 
GAATGGCCAG TTAACCACTG GGAGAGCATC CGGACAGACG TTTCGCCAAG ATGGGTGGAA 59400 
TGGCCAGTTA ACCACTGGGA GAGCATCCGG ACAGACGTTT CGCCAAGATG GGTGGAATGG 59460 
30^ CCAGTTAACC ACTGGGAGAG CATCCGGACA GACGTTTCGC CAAGATGGGT GGAATGGCCA 5952 0 
GTTAACCACT GGGAGAGCAT CCGGACAGAC GTTTCACCAA GGTGGATGGA ATGACCAGTT 59580 
GAGCACATGG AAAGTCGCCC AGCATCTCCA GTCATAGGAG AAGGCAGATT AAAGCCACGG 59640 
GGAGCCGACA CTGTGGTCCC ACTGGCATGG CTGAAATTCA GAAGCCCTGA GTGTGGCATG 59.700 
AGGATGTGGA ACAGCTGGAT CTCATCCATC GCTGTGAAGT TGTGTAGCCA CTCCACAAAC 59760 
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GTGTGGCAAA CAGCCGAGCC GGGAGAAGGG AAGACGTGTT CAAAGATTCA TATGTGGCCA 59820 

• GGCTCAGTGG CTCACGCCTG TAATCCCAGA ACTTTAGGGG CCAAGGCTGG GGGATCGCTT 59880 

AAGCCCAGGA GTTTGAGACC AGCCTAGGCA ACATAGGGAG ACCCCATCTC AAAAAAAAAA 59940 

AAAAAGAAAA AAGAAAAGAC TTCAGTGTGC AGGTTTACCA GAGTTTTGTT TGCAGTTGCC 60000 

5 AAAACTGGGA AGCAGCCCGC GTGAGCCCAT CCACAGGTGA ATGGACAGAC CGTGGTACCC 60060 

GAACACTAAC AGCAGCCACG GGCGTGGACT GTGGTCACAC AGCAGCAGGG AGCCGATGAG 60120 

TCTCGGACAT GCTAACCCAG AGAGGCCCAT TGAGGAGGAC CTACTGTTTT TTGTGTTTTT 60180 

GTTTTTTGTT TTGAAATGGA GTCTCGCTCT GTGGTGCAGG CTGGAGTGCA GTGGTGTGGT 60240 

CTTGGCTCAC TGCAGCTTCC GCCTCTTGGG TTCAAACAGT TCTCCTGCCT CAGCCTTCCG 60300' 

10 AGTAGCTGGG ACTACAGGCA CCCGCCACCA CACCCGGCTA ATTTTTGTAT TTTCAGTAGA 60360 

GACGGCAGTT CGCCATGTTG GCCAGGCTGG TCCCAAACTC CTGACCTTGT CATCCACTCA 60420 

CTTTGGCCTC CCAAAGTGCT GAGGTTGCAG GCATGAACCA CCGCACCCGG CTGGACCTAC 60480 

TGTTT.TATTC CATTTATGTG ACACTCTATT AATAGAAAAG GCAGGGGTGG GGCTGGTGGT 60540 

TATATGGTGC ACATAACTGC CAGAACTCAG TACACTTAAA ATGAACATCT TAATGTGTGA 60600 

15 AATTTTTTTT TTTGAGACGG GGTCTTGCTC TGTCACCCAG GCTAGAGTGC AGTGGTGCGA 60660 

TCTCCACTCA CTGCAAGCTC TGCCTCCTGG GTTCACGCCA TTCTCCTGCC TCAGCCTCCC 60720 

GAGTAGCTGG GACTACAGGC GCCCGCCACC ACGCCTGGCT AATTTTTTTT TTTTTTTTGT 60780 

ATTTTTAGTA GAGACGGGGT TTCACAGTGT TCGCCAGGCT GGTCTCGATC TCCTGACCTC 60840 

GTGATCCGCC TGCCTCGGCC TCCGAAAGTG CTGGGCTTGG AGGCGTGAGC CACCATGCCC 60900 

20 GGCCAATGTG TGAAAATTTA AAAGTACCAA AGCTGGACCC CACCCCAGAT TGCTCCCATG 60960 

ACACTCTGTG GGTGGGACCT GGGAGTTGGG TTTTGTTTTG TTTTGTTTTG TTTTTGAGAT 61020 

GAAGTCTCAC TCTGTCGCCT AGGCTGGAGT GCAGTGACAC AATCTCGGCT CACATTAACC 61080 

TCTGCCTCCC AGATGAAAGC - GATTCTCCTG CCTCAGCCTT CTGAGTAGCT GGGATTACAG 61140 

GCACACACCA CCACCCCCTG CTAATTTTTG TAT.TTTTAGT AGAGACGGGG TTTTACCATG 61200 

25 TTGGCCAGGC TGGTCTTGAA CTCCTGACCT CGTGATCCGC CCGCCTCGGC CTCCCAAAGT 61260 

GCTGGGATTA CAGGCGTGAG CCACCGCGCC TGGCTGGGAG TTGGGTTTGT AAATCTCCCT 61320 

GAGTGGGGCT GGGGCAGGGA ACTGCTGGGT CTGGGTCTTC CTGGCTCCTC TGGTCTGTGG 61380 

CTTCCTGACT GCGGTGGCCG GGGGCTCCCA GGGCATCGTG GCCGTCTGTC TTGCTGAGCG 61440 

TGGCACGTGC CTTTCCATGC TGTGGAGGAG CGTCTCCCGG TATGGCGAAC TGCTGGTTAG 61500 

30 GGTGGGGCGG TGTTGCCAGG TCATCCAGGT CTGGCCTCTG CTCTCGACAT CGCCGGCGCT 61560 

GTTGCTCATC TGCGCTTGTG ATGTTCGATG CCTGCTGCAC ATGTCTTGGC TTCCCTCTTT 61620 

GCCGGCCTCT GTGAGCTCCA GCGCTGCGTC CCTTCTCTTC CTCCTGTAGA GCCGCAGAGC 61680 

ACACAACATT GACCTGAAGG GGACAGTCGT GATCTTTGAC GAAGCTCACA ACGTGGTGAG ■ 61740 
TCTCCGCTGG CCTCCTAAAC ACCTCCTATT GCTTCTGGCC TTTTTGTCAA GAGCCACGCA 61800- 
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AACCTTTCTG 


GAGGGGCTCT 
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GACTGGGGAC 


blooU 




TGAGCACACC 


AGGAGCTTCT GCCACCCCCT CCCGCCCTGA TCCGATGCCT CTGCTGGGGC 
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TGTGGTCTTG 
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62100 




GGCTTGAGAC 


TGAGGCAGGA 


GAATCGCTTG 


AACCTTGGAG 


GCAGAGGCTG 


PAnTPAPPPA 


62160 




GGATCACACC 


ACTGCATTCC 


AGCCTGGGTG 


ACAAAGCGGG 


ATTCTGTGTC 


AAAAAAAAAA 


62220 




ATGTTGACTG 


GGCGCGCTAG 


CTCATGCCTA 


TAATCCCAGC 


ACTTTGGGAG 


pptpapcitc;c; 
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GCGGATCACG 


AGGTCAAGAG 


ATCAAGACCA 


TCGTGGCCAA 


CATAGTGAAA 


pappptptpt 


62340 


10 


ACTAAAAATA 


CAAAAAAATT 


AGCTGGGCGT GGTGGCGTGT GCCTATAGTC 
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\~ LAVA. X i. V— 
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GGCAGGAGAA 
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CAAAAAGAAT 


AATTGGCAAT 


TCCAGTGAAA 


TAATTGTTTG 


TTTGTTTGTT 


PAPaParznpT 


62580 




CTCCTTCTGT 


CGTCCAGGCT 


GGAGTTCAGT 


GGTATGATCT 


TGGCCCACTG 


P AAPPTPPAP 


62640 


15 


CTCCTGGGCT 


CAAGCCATCC 
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TTTTGTATTT 


TTTGTAGAGG 
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GATCTGCCCA 
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TCGTTTCTAA 


ATATTGGTGT 
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GTCATGTTTG 


GACCTACTTG 


TGGCCTTTTA 
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2G 
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GCTGTCAGCC 


TCCTGTGCCT 


TCTGCACCCC 


CACCCCATTT 


CTGCTTTCTG 


63000 




GAACCCCCGA 


TCCTGTCCTG 


TTCTGTGGTG 


ATTCGGGTGT 


GCTTGGGCTC 


TAGGAGAAGA 
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TGTGTGAAGA 


ATCGGCATCC 
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CCTGGCTTCA 


GGACTGGACG 


63120 




TCATAGACCA 


GGTGCTGGAG 


GAGCAGACCA 


AGGCAGCGCA 


GCAGGGTGAG 


CCCCACCCGG. 
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AGTTCAGCGC 
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CGTTCATAGC 


CAGACTGCTT 
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25 


CCTGCGCTGC 
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c -i ~i n n 




GCGTTCATAG 


CCAGGCTGCT 


TGGTCCTGAG 


GCCCGTGCTA 


CTGCAGTGGG 


CAGCCTGCCC 


63360 




TGTGGCTGTG 


TGTGGTCGGC 


CTGGGCACCA 


TCTATTCAGG 


CTGGCACTGC 


AGGGCATCCG 


63420 




CTTCTCTCAG 


AGGCTTCTTG 


GGTGTGAATT 


CTTCAGGGTC 


CTGTAGCCTG 


TGGAAGGGCT 


63480 




GGTATTGTTC 


AGTAGTTCTG 


GTATTTTCCA 


AAGACCTATG 


TCTTCTCCCA 


GCCAGTATCA- 


63540 


30 


ACTTGGCCTC 


TACTGTGTAA 


AACTGGAAAA 


CTCTACTTTG 


TGAAGCTGAG 


TTGGGAGCAT 
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CGCTTGAGGC 


CAGGAGTTTG 


AGACCAGCCT 


GGGCAACATG 


GCGGAACCTC 


GCCCCTGCCA 


63660 




AAAAATTAGC 


CAGGTGTGGT 


GGTGTGCTCC 


TGTGGTCCAA 


GCTTTTCTGG 


AGGCCGAAGT 


63720 




GGGAGGCGTG 


CTTGAGCCTG 


GGAGGCAGAG 


CTTCCGGTGC 


CCCAGATGAC 


TCCACTGCAC 
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TCCAGCCTGG 


GCGGCAGAGT 


GAGGCCATCT 


CAAAAAAAAA 


AAAAAGGAAA 


ACTAAATATA 


63840 
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TTCACTGTAA GGGCATTTTG CATCTTTAAA TGACCCACAA ATCTGGCATG CATCAGCTGC 63900 

TCTGCCTGTA GGTTCCTTCC CAGTGTTTGT CCAGAGGTGT ATTTCCACAC AGCGCTAGTC 63 960 

ACGGCATATG TGGAAAACGT GGAAACCCTT CATGGATGTT GTCAGTTGGT CTATATTTTC 64020 

1 TTTCTTTTTT TTTTTTTTGA GATGGAGTTT CACTTTTGTT GCCCAGGCTG GAGTGCAATG 64 080 

5 GCGCGATGTT GGCTCACTGC AACCTCCGCC TCCTGGGTTC AAGCAATTCT CCTGCCTCAG 64140 

CCTCCCAAGT AGCTGGGATC ACAGGCGTGC ACCACCACGC CCAGCTAATT TTGTATTTTT 64200 

AGTAGAGATG GTTTCTCCGT GTTGGCCAGG CTGGTCTCGA ACTCCTGACC TCACGTGATC 64260 

CACCCGCTTC GGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAGCCGCCAC GCCCGGCCTT 64320 

TGTCCATATT TTCTACATGG CTTCTGTAAA CAGCTGACTA GGAGTCTGTG TGAATATCTT 64380-. 

10 CATAGGTTCT GCTGTGACAC TACTTGCTCG TGAGCATCTC CAGGTGTAAA CAGCATCAGC 64440 

TTCCCGCATT TTCCTTTAAA ATCGCACATG TGGACGGACA CCACGGGGAC CCTGGACCCT 64500 

GGGGAGCCCC GTCCTCACCC TTCTCACCAG GATGGCTGCT TGGTAGAGAG TGAGTTTGCA 64560 

AAGTTGGCAT TTGTTTAGTA CAGAAGTTAT CAGGTGTTCT GGCTTTAGAA TCCCTTTATA 6462 0' 

TATATATATA TATACATATA TTTAAGTGAC AGGGTCTCAC TCTGTTGCCC AGGCTGGAAT 64680 ■ 

15 GTGGTGGTAC AATCAAAGTT CCCTGTAGCC TCGGCCTCCT GGGCTCATGG GATCTTCCCG 64740 

TCTCAGCGTC TTAAAGCGCC GGGACCACAG GTGTGCACCA CTGCCACCGG CTCTCAAGAT 64800 

TGCCACGCAG GGAGTTGCAG TGGGGGAAGG GGTTCCTGGG ACTTTGAACG CTCCACCTCC 64860 

■ CTCCTCTCCA CAGTCCCCCA ACCCCACCTC TCTAACGGGG TGGACGGCCG CCTCTTTCCA 64920 

TCCTTCGCTT GGCGCAGGGT GGGGAGAGTG ACAGGTCTCC TTCCCTCATC TCGGCAGCTG 64980 

20 CCATTTCATC GCTTACATAA CGTGGGAGAA ACATCCACCC ACCCCCAGGC CTGTGTGAAC 65 04 0 

ATCACCACGG GGCCTTCTCC ACTCTTCAGT TTTGTTAGTT ACTTGATGTG CAGGGCTTTT 65100 

TGTTGTAACT AGTGGGGGAC GTGTGGTGGG GTGGGCTTCT GCCATCTCAT TCAGGACCAG 65160 

AACTTCAGTT TTCATCCCTA TCTGTTCCCC CACCCCTTTG GAGATGGGGT CTCACTCTGT 6522 0 

CACCCAGGCT GGAGAGCGGT GGTGCCATCA CGGCTCACTG CAGCCTCCAC CTCCTGCAGC 65280 

25 CTCCACCTCT TGGGCTCAAG TGATCCTCCT GCCTCGGCCT CCCAAGCTCC TGGGACTACA 65340 

GGCGTGTGCC ACTGTGCTTG GCAGGGTCCA TTCTTTTCCT CACACTTTAT TTATTGAAGA 65400 

GCCCAGGCCG TTTACCCTGC AGAGTCGGAA TCTGTAGAGG AGGGGCAGCC ACACGAGTTC 65460 

CCCGGTTTAC TCTGAACTTA GGTGGCTTGA GGGCCCCAGT TAGACTGCGG CCACCGTTTG 65520 

CCGGGCTCCA GATGGGACGT CCTTTCTATC AGAAGGCTCA CAGTATCTCC TTTCCCGTTT 65580 

30 CTTCCCATGT GAACATTGTT GCTGCTGAAC ACCTGAATAT GTTAATCACT GGGGGCTTGC 65640 

AAGATGGCAG TGTGCTAATT CCATCATCTA GTCAGTTAGC AGGAATAACT TAGGACCACG 65700 

CCCTGCACCA TATCAGCTAT GTGGTGATCC CATTCACACA GGAAAGGTGG GACAAATGCT 65760 

GGGGGTGGGC CGGGTGTGCT GTCTCACACC TGTCATCCCA GCACTTTGGG AGGCCCAGGC 65820 

AGGCGGATGA CGAGGTCAGA GATTGAGACC ATCCTGGCCA ACACGGTGAA ACCCCGTCTC 65880 
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TACTAAAAAT ACAAAAAAAT TAGCCAGGTG TGGTGGTGCA TGCTTGTAAT CCCAGCTACT 65940 

TGGGAGGCTG AGGCAGGAGA ATCACTTGAA CCCAGGAGGC GGAGGTTGCA GTGAGCCGAG 66000 

ATCGCACCAT TGCACTCCAG CCTGGCAACA GAGCGAGACT CCGTCTCAAA AATCAATCAG 66060 

TCAATCAAGT GTCATCACTG AATGTTTGTG TGTGAACGTG GGGATTGGTC CTGCCCCATG 66120 

5 CTCCCTCCTG AATCTCACTC CTGACCTCAG TTGCTGCACC TTGAGGTGTT TTCTGTGGGC 66180 

TCTTGTGTCC TGACCCCGGC GGTTGTGGCC TCTGCTGTCT GGGAGTCAGG ATTTTTCACA 66240 

CTCATGTCCT GCTCCAGACC TGGAATCAGC CAAGTCTCCA AGAAGCCCTG CTTTCTTTTC 66300 

CTGCAAGACG. GTATTTCAAG ACCCGCCGTG CGGCAGCGGG TTGGTCATGG TTACTGGGTT 66360 

GGTCGTTGTG ACTGGGTGTT TTCGTGGAGA TACAGCCATA CGCACAGGTG TGTTCACAAA 66420 

10 TGTTAATTCT AAAGGTCAAA CACCCGGCCA GGCATAAGGG CTCAGCGGTA ATCCCAGCAC 66480 

TTTGGGAGAC CAAGACTGGT" GGATCACCTG AGGTCAGGAG TTTAAGACCA GCCTGAGCAA 66540 

CAGGGTGAAA CCCCATCTCT ACTAAAAATG CGAAAATTAG CCGGGCATGG TGGCGCACAC 66600 

CTATAGTCCC AGCTAGTCGG GAGACAGACA CGAGAATTGC TTGAACCTGG GACATGGAGG 66660 

TTGCAGTGAG CAGAGATGGC GCTGCTGCAC CCCTGCCTGG GTGACAGAGT GACACCCTGT 66720 

15 CTCAAAAATG AATAGATAAA TAAAGATAAA ACACCTGCTC CTCTTGGTGT CTCCAGTTTG 66780 

GATTTGGCCT GTGTAGCCTC TTCCTTCGCC TGTTGGTGGA TTTGGCCTGC ACGGATTCTG 66840 

TGTGGCCTCT TCCTTCCCCT GTTGGTGGAT TTGGCCTGCA CGGATTCTGT GTGGCCTCTT 66900 

CCTTCCCCTG TTGGTGGATT TGGCCTGCAC GGATTCTGTG TGGCCTCTTC CTTCCCCTGT 66960 

TGGTGGATTT GGCCTGCACG GATTCTGTGT GGCCTCTTCC TTCCCCTGTT GGTGGATTTG 67020 

20 GCCTGCACGG ATTCTGTGTG GCCTCTTCCT TCCCCTGTTG GTGGATTTGG CCTGCACGGA 67 080 

TTCTGTGTGG CCTCTTCCTT CCCATGTTGG TGGATTTGGC CTGCATGGAT TCTGTGTGGC 67140 
CTCTTCCTTT CCATGTTGGT GTCCTTTTTT CCATGCCAGG AATCCTGGTT CTCAAGGGCG 67200 
GGGTTGTTGG CACGAGCGTG ATGCAGACTG CCTTTGCTGC CTTTCTCTTG CCCAGGGCTG 67260 
■ AACATGGAGC TGGAAGACAT TGCAAAGCTG AAGAGTAAGT GTTGCCCTCC CCGCCTCCTT 67320 
25 GCAGCTGGGT GGGGCCTCCT CCTTGCGAGG AGGTGGGTGA CACCTCCTCG ACCCACAGTG 67380 
ATCCTGCTGC GCCTGGAGGG GGCCATCGAT GCTGTTGAGC TGCCTGGAGA CGACAGCGGT 67440 
GTCACCAAGC CAGGGAGGTG AGAGGCGGGG AGCCAGCCCC TTCACTGCAG GCCCAGCCTA 67500 
GAGCTAGAAA CGGGCCATGG TGCAGTCCTG GGCTGTCACA TCACGAGTGA GGCCTGTTTT 675 60 
CAGGCCTGTT TTCCCTTTTT GAGACCTGGG AGGAGCACCT GCTTTGCATG ATCTGGTTGC 67 620 
30 TGAGATGTTG AGAGGAGCAG CACACACTCC CACGGGACAG CACACAGCCC CCCACGGAAC 67680 
GGCACACACA CCCATGGAAC AGCACACACA CTCCCACGAA CAGCACACAC ACTCCCACGA 67740 
ACAGCACACA CACTCCCACG GAACAGCACA CACACCCACG GAACGGCACA CACACCCACG 67800 
GAAGAGCACA CACACTCCCA CGGAACAGCA CACACACCCA CGGAACGGCA CACACTCCGA 67860 
CGGAACAGCA CACTCTCCCA CGGAACAGCA CACTCTCCCA CGGAACAGCA CACACACTCC 67920 
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GCCCAGAGTC 


ACCTGGTTGG 


TGGTGGCAGA 


GCCACAGCCC 


/ 3^4 U 




AGCCAGGCCT 


CCCTGCTGGG 


ACACGCTCGT 


TTATGCCGAG 


GCCGTCAGCA 


CAGAGGCTGG 


/ D jUU 




ACAGTGAGGC 


ACGGCTCTGC 


CTGCTGCCTC 


CACGCAGCGC 


CTGGCCGGGC 


C AAGG G TG AG 


"7 C. "> C Pi 




GGTCACATCT 


GAAGGGGGCC 


CGGCTGGCCC 


TGTTGTCCGA 


AGCCCCTGGT 


Pr'C^PT'P 7i CT^H 

GL.W- 1 UAVjCL. 






CCGAGGCCCC 


ACGTGCCTTC 


TTGGCTTCCT 


GTGCTCCGTG 


GCGTCTTCGA 




/ 340U 


25 


CCGGGGACGC 


TGTGTGGATG GGGTCTGTGA .GTGTGCCCTC GGCTCCGTGT 








GTGGTTCTTG 


GGGTGTATCT 


GGCCCCACCC 


CCACTGCGTG 


GTGTCCAGGG 


TGGGGCTTCA 


75600 




- CGGCTGCAGC 


TGCGGGAGCT 


GCTGCCCCTG 


CCTTGTGCTC 


CAGTGGGGCC 


TTGCCTCTGG 


75660 




GCTTGGTTCG 


TCCCTCTCTG 


GAACATTCTT 


TCTCAGCTGC 


TGTCCGACCC 


ATGGTGGCAT 


75720 




GACGTGGCCC 


TGGCTGAAGC 


AGCCCTTGTG 


CGGTTGCTGT 


GGTTGGGTCT 


GCCTGGCCGA 


75780 


30 


GCCGGAAGGG 


AAGGGCTGGG 


AGGGCGTCAG 


GGTGGCGTGG 


CTTGACCCCC 


GCTCGGTGAT 


75840 




GGTCCTGCAG 


CAAGGCCTCT 


CCCAGCAGGA 


AGCGTCCATC 


CCGGGGGGAG 


GCCGGCGCCC 


75900 




CTCACGCAGT 


TGGGGTTGCG 


GGAGGCAGTG 


CGTGCCTGAG 


GCAGCCGGTG 


CACAGATTCC 


75960 




AAGGGCCTGG 


AATCTGTTTG 


TTCCATTGAC 


CTCTGATGTC 


ACTTGACTTC 


TCAGAAGCAG 


76020 




CCACTCCCTG 


CACTGGGCGT 


TTGTAGGAAA 


TGAGCTCCTG 


GAGGAGGGGG 


TGGGGAAGTT 


76080 
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CCCCCATTGC AGGGCACACT CAGCCCCAGG AAGGAAACGT GCCTCGTCCC TGCTGACTCC 76140 

GAATCGCAGT CAGAGTCGTT CTGCTTGTGC CGTGTTGAAT TCCCGGCATC CGGCATCCAG 76200 

ACTCAGCCTC CTCCCCAGGC CACGGCCGGC GTGGCCAGTC GGTCAAGCCC TTCTAGGAAC 76260 

TTCCTTTGAG CTGGCGCCCT TGTTCACTGC TGACGCCACT CAGAGGCTTG TGCACGTGTC 76320 

5 CTGCTTCCAG GCAGAGCTGG GAACTCGCAC CCCGTCTTCT GCACGCGGCC GTGGAATGTC 763 80 

GGGATGCCGG CGCTTCCTTC CCGTGTGCTC TTGGCGGGGT G.GGC.TTCTTG CCCTGAGCCG 76440 

CATGTCACAG TTTCTGCAGA AGTTTAGGGT TGGAGTGGGC TGACCTCTCT GCAGGTGTCC 76500 

CCAGCCTCTG CCTGGGGTCT GCCTCCTACT CCCAGGACCC CCTGTCCCCC AGAGGGGCCC 7 6560 

CAAGCTGGCA GGCTCACACT CAGGGCAGCC TCCTTTGTTC TGACTTCTGC ACAGTGGGCC 76620 

10 TGGGTGGCTG. CCCGCGGCTC GCTTGCTTGA TGCCAGTGGG TGGAGAGGGT GATGGGCAGA 76680 

GAGGCAGGTG GTCAGGCCCC CAGTCCCGTC CTCACACTCT GTGCCCTCTG CCGCCCCCCG 76740 

CCCCAGAGGG AAGGTGCTGA GCTACTGGTG CTTCAGTCCC GGCCACAGCA TGCACGAGCT 76800 

GGTCCGCCAG GGCGTCCGCT CCCTCATCCT TACCAGCGGC ACGCTGGCCC CGGTGTCCTC 76860 

CTTTGCTCTG GAGATGCAGA TGTACGGGCC ACCCCTGCCA GGGCCTGAGC ACCGGTGACA 76920 

15 CCTCTGACAT CAGCGGGGTG GAAGTGGTGG GGGTCCCCAT GAGCCGGGTG CTGGGGGTCT 76980 

CGGGCCTCGA GGGCTAAAGG GGTGCTGGTG CACTTCCCCA CTGTCTGCTC CCTCTGGCCA 77040 

CGCTCAGCCC TTTCCCAGTC TGCCTGGAGA ACCCACACAT CATCGACAAG CACCAGATCT 77100 

GGGTGGGGGT CGTCCCCAGA GGCCCCGATG GAGCCCAGTT GAGCTCCGCG TTTGACAGAC 77160 

GGTGAGGGCC TGTCCCTGGG CCCTGCTGGG GTGGGAGGTG GGGGAGCACT GAGGCCTGAG 77220 

20 GCCCTGAGCA GTGGCCTCTC CGGCTCTAGG TTTTCCGAGG AGTGCTTATC CTCCCTGGGG 77280 

AAGGCTCTGG GTGAGTGCCC TGAATGCCCC AGCTGTGCGC ATCCTGGATC CTGGACCCCT 77340 

GCTCCCAAGA GCTGGTAGGG ACCCCTGCAG ACATCCTGCC CCTGCCTTGA CCCCGGCCCC 77400 

TGCACTTCCA GGCAACATCG CCCGCGTGGT GCCCTATGGG CTCCTGATCT TCTTCCCTTC 77460 

CTATCCTGTC ATGGAGAAGA GCCTGGAGTT CTGGCGGGTG CGTCTCCCCT GTGTTCTGGG 77520 

25 CGGC3GTGGGT- GAGGGCAGGG CTGGAGCATG AAGCAGGCAG TGGTCACAGC TCCTGCTTGC 77580 

CCTCATCGGA TCGGCGGCGT GACCAGGGCT GCCGTGTCCC TGCCTCTTCC TCCCACAGGC. 77 640 

CCGCGACTTG GCGAGGAAGA TGGAGGCGCT GAAGCCGCTG TTTGTGGAGC CCAGGAGCAA 77700 

AGGCAGCTTC TCCGAGGTCG GCACTTGGCC GGGGCTCTGG GCCTGCTGCC CCCTCGTGCC 77760 

TCCCCTGCCT CTCACAGCTT CCCCAAGGCT GACCACTGGC CCTGACCATG GGCTCCGGCG 77820 

30 GCTCCCGCTG CCTCTTCAGG GCTCCTGCGT TTCCTTCCTG GCCCTGAGTG TTGCCTCTTA 77880 

TCTTACAAAG CCCCCAGCAC CGGGTGGGTG TGGTAACAGT GGCCCTCCTG TCTGAGTAGC 77940 

CCTAGTCGGC CACCCTGGCC CTGGGGTTCC CCGTGTTTTC TGGGAAGCAC TGAGCAGGCG 78000 

TGGGGTCAGC CTGGGATCCG TGCCAGGAAG AAGCTTCCAG AACCCGATTG GCCTTCCTGG" 78060 

CTAGGACGAT CCTTCATCTT GGAGCATGAG ACCTGGGTCT CCCTCATGGG GGAGGAAGGG 78120 
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GCTGGGGGGG GGCTCCAGGC TCAGCCTCAC CAACTTTCCT TCCAGACCAT CAGTGCTTAC 73180 

TATGCAAGGG TTGCCGCCCC TGGGTCCACC GGCGCCACCT TCCTGGCGGT CTGCCGGGGC 78240 

AAGGTGAGCT CTCCAGGGCC CTCTGCCCTG ACCTGGTTGC CTGTTCCCTG GTGGGTGCTT 78300 

ATGGCTCCCC AGCAGACTCT GGGCCCTGGG GGCTGCGCGG TCCCCTCCTT GGGTCCCACG 78360 

5 AGAGCGACTG CTGGCCCTGC TGGGAGCGTG TCCTGCTCTG GGCCTGGGCA GGCAGGATGG 78420 

GAGTTTCCTG GCCACAAGAG TTGGAGGTGG CGTCTGGGAG CTGTGGACCC CAAGTGGGGT 78480 

CCTGACCCAC AGATGGAGCT TCCTCCCACC CCTGGTTGGG GACGGAGCCT CGGGGAAGGT 7 8540 

GGCTGGGCTG GGTGTGGGCA CCAGGGAGAG GAGCCCCCAC GGCCCCAGGC AGCTCCCTGG 78600 

TGTGTCCCCT AGGCCAGCGA GGGGCTGGAC TTCTCAGACA CGAATGGCCG TGGTGTGATT 7 8660 

10 GTCACGGGCC TCCCGTACCC CCCACGCATG GACCCCCGGG TTGTCCTCAA GATGCAGTTC 78720 

CTGGATGAGA TGAAGGGCCA GGGTGGGGCT GGGGGCCAGG TGAGTTACAG CAGGGTGGGG 78780 

GTGGGGTAAG GCGGTCTGGT GACTGAGCCC CCGCCCCGTG GCCAAGGGAG CCCCCGTGAC 78840 

CGAGCCGCCT CGCCCCACAG TTCCTCTCTG GGCAGGAGTG GTACCGGCAG CAGGCGTCCA 78900 

GGGCTGTGAA CCAGGCCATC GGGCGAGTGA TCCGGCACCG CCAGGACTAC GGAGCTGTCT 78960 

15 TCCTCTGTGA CCACAGGTGC GTGCAGTCCG GTGGCAGGCG CGGCGCCAGG GGACACGCCC 79020 

ACACCCCACT GGGCCCCTGG ACTCTCCTTC CCCACATGAG GCCCCGTCTC CTCCAGAGCC 79080 

TCTCCGGCTA CTCGGGGTCA GCGTGGGGCC GCTGCAGCAG ATGAGGGTCT TCACTTCGGT 79140 

GAACTGAACC CTTGAAGCGG CTGTGGGCAG GGCAGCAGGG CTATGGCCAC CCCCCAGGTT 79200 

CGCCTTTGCC GACGCAAGAG CCCAACTGCC CTCCTGGGTG CGTCCCCACG TCAGGGTGTA 79260, 

20 TGACAACTTT GGCCATGTCA TCCGAGACGT GGCCCAGTTC TTCCGTGTTG CCGAGCGAAC 79320 

TGTGAGTTCC TGCCCAGGGA GGGGATGAGG GTGTTGTCCC CAGAGGAGCC AGAAATGGGT 79380 

CCACCCACCC CCATGGTTCT GCAGATGCCA GCGCCGGCCC CCCGGGCTAC AGCACCCAGT 79440 

GTGCGTGGAG AAGATGCTGT CAGCGAGGCC AAGTCGCCTG GCCCCTTCTT CTCCACCAGG 79500 

AAAGCTAAGA GTCTGGACCT GCATGTCCCC AGCCTGAAGC AGAGGTCCTC AGGTGCGGAC 79560 

25 GGGCAGGGCT GGGTGGGCGG TGTGGGGGTG GCGGAGC'GGG CGGCGTGGGG CGGGCAGCAC 7 962.0 

CAGGCGCCCA GGGCGGAGGC GACTCACCTG GCTTTGTGCG CTTCCCCTCC CACCTCCAAA 79630 

GGCTGCCTCT CCCTCCTAGG GCAGGGCCCC CACGGGCTGC AACCCTCCCC TACAGGCAGA 79740 

GAACGCCCCA GGCAAGGATG CCCCCCGAGG CTGAGACTCC CCCCAATAGC AGGGAGGACA 7 9800. 
CCCACAGGCA GGACCCCAAG TGCTGGGACT CTCCCCCAAG AGGGGCTT.TG CCACAGGCAG 79860 

30 GGACCCCAGC TGGGGCCCCC CGTGGGCTTC ACTGCGCACT CGGGTGCCCC TGCAGGGTCA 79920 
CCAGCTGCCG GGGACCCCGA GAGTAGCCTG TGTGTGGAGT ATGAGCAGGA GCCAGTTCCT 79980 
GCCCGGCAGA GGCCCAGGGG GCTGCTGGCC GCCCTGGAGC ACAGCGAACA GCGGGCGGGG 80040 
■ AGCCCTGGCG AGGAGCAGGT ACAGTTCCAG GGCCTTGGGA TGGACACAGA CCCTCTGTCT 80100 
CCTGAGGCCA ACCCGACCCC GCCCATCTGG CCTCAGGCAC CTCCCCACAC ACCCCTGTAA .80160 
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> 


ATCCCCTGCC 


TGGCAGGCAG 


GCGGGCAAGC 


GGGCGGGGGA 


TCCCAGCTGC 


CTGGCTGTCT 


80220 




GTGGGTCCTC 


CACCCCACCT 


CACCCACAGG 


CTGCTGGCTC 


CCAGGTGGTG 


CATGCCCTGG 


80280 




CCCTCCGCGG 


GTGCCCCCCA 


CATCACTTTG 


GTTCTCTGGC 


GGGTCAGCTT 


GGCTCAGTGC 


80340 




ACTCAAGGTC 


GGGTGCCCCT 


GCCACTGGCT 


GGGCTTGAGG 


CTGGCCTTTC 


TCCAGGAATG 


80400 


5 


TGCTGCGGGT . 


GGAACCCAGG 


TTCCTTCTTC 


CTTGGGGCCT TTTGCCCCAG AAGCCCATAA 


80460 




TTCCTCAGGC 


CAACCCGAAA 


TTTTCTCCCT 


GCTTCCTGCT 


GGGAGCCATT 


CCCCTCTTCC 


80520 




TGCCCATCCC 


TGCCCTTCAG 


GCCCCTGGAG 


TGAGCTCCAG 


GTGCAGGCAC 


CAGGGACCTG 


80580 




TGTCCCCTTC 


CTGCCAGCCC 


CTCGCTGTGG 


TCGGACTGTC 


TTCCCTGGAC 


CTGCTCTTAC 


80640 




AAGTCACCAC 


CTGCGAGCCT 


CATGAGCCGC 


TGGTGTGACT 


TGGACAGGAC 


CAAGTTGTGG 


80700 


10 


CACTGTCACC 


GGGGTGTGCT 


GTGCCCCCCT 


CCCCCGACCT 


CCATCTTGGC 


TCAGGGCTCC 


80760 




TTGGGACCAT CTTCCCTGTG CGTCCAGGTG CTTTGGGACC 


CCAGAGTGTG 


TGGTTGGGGT 


80820 




CTGTGTGTGG 


TTGTGAGCTG 


TGTCCTCCTC 


AGGCCCACAG 


CTGCTCCACC 


CTGTCCCTCC 


80880 




TGTCTGAGAA 


GAGGCCGGCA 


GAAGAACCGC 


GAGGAGGGAG 


GAAGAAGATC 


CGGCTGGTCA 


80940 




GCCACCCGGT 


GCGTGAGCTG 


TCCCTGCACC 


TGTGCCGACC 


ACCATAGACA 


CGCATGGGAA 


81000 


15 


CGCAGCCGTG 


GGTGCCCCCA 


GCCACGGCTG 


GTCCCGATGG 


GACCAGGGAA 


TCCACCCCCA 


81060 




GGAGCTGATG 


TCCAGGGCAG 


CTGTGATGCT 


GACGGCCAGG 


GGCTCAAGTG 


TGTGGTTTCT 


81120 




TCTGCAGGGG 


GCTCATGAGT 


CCCAGCTGGA 


ATCAGGCCCC 


ACCCTTGGGC 


AGGTTTGGCA 


81180 




TGGGGCCTGC 


AGCACTGGGC 


TTGGCCCTGG 


CATTTCCCTC 


AAGTGTGGAT 


GCACACCTGC 


81240 




CTCATGTGAG 


GGACACAGCC 


CATTCCTAGC 


CTTGGATCAA 


AGAACGGAGT 


TATAGCCGGA 


81300 


20 


GCCAGGAAGC 


CCCCTGCCTG- 


CTGGAAAACC 


CCAAGTGTGG 


CGGCCTTTGT 


CCATGTCCCT 


81360 




TGGCTTCTGG GAAGAACTGG GTGGTGCCCA 


GGCAGGGCTG 


GTGCCATCAG 


GAAGTGGGTG 


81420 




GCTGCTGAGG 


GGCCTGGGCT 


GGCGAGGGCC 


TGGGTGGGGA 


GTGCCTGGGC 


CGCCCCTGCC 


81480 




TTGGTTTCCA 


CGTTTCCGTG 


TTGGTCTGGG 


GTGTGTAGAG 


AGATGGGCAC 


TGCTGATCCG 


81540 




GAAGCCCCTC 


CTTGTGCGCT 


GCCATCCTGG 


GAGCCTCAGC 


CGCATCCGCT 


GTGGGGCAGG 


81600 


25 


GGGCTTGAGG 


GAGGAGGAGA 


GAGACGGGCC 


ATGCAGGACC 


CCTGGCTTGA 


GGCAGAGCCA 


81660 




ATCTACCCTT 


TGCCCATTCA 


CTGCTCTCAG 


TTCCCTGCCA 


. GCCTCTCACT 


GTGTGACCTC 


81720 




AGACGGGCCC 


AGCCCCACAG 


CTTTCTTCCC 


GCAGCCCCTC 


CCTATGTCCA 


TCCAGCCAGC 


81780 




CAGTTTCTCA GGCAGCAGCC 


CCACCTCGGC 


AGTCACTGTC 


CCAGGGAACG 


CTCAATGTTC 


81840 




CAAGGAAGGC 


TCTGCAGCCC 


CAGGGACCAG 


ATGATGAGGC 


TGGCCCTGAT 


GGAGCCTCGG 


81900 


30 


GCCTGTGTCC 


TGCAGGAGGA 


GCCCGTGGCT 


GGTGCACAGA 


CGGACAGGGC 


CAAGCTCTTC 


81960 




ATGGTGGCCG 


TGAAGCAGGA 


GTTGAGCCAA 


GCCAACTTTG 


CCACCTTCAC 


CCAGGCCCTG 


82020 




CAGGACTACA 


AGGGTTCCGA 


TGACTTCGCC 


GCCCTGGCCG 


CCTGTCTCGG 


CCCCCTCTTT 


82080 




GCTGAGGACC 


CCAAGAAGCA 


CAACCTGCTC 


CAAGGTGCCC 


TGGCTTGCAG 


AGGGCACCCA 


82140 




CCCTGAGGGC 


AGTGCTGCGG 


CCGCGTGTGG 


GGTGGGGGCC 


ATCTGGGTCC 


AAGGTGGTCT 


82200 
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CTGTTCTCTA 


G AG AAAAAll LLL AuA I Lvj L? 


CAP AP^APPPP 


CCTTCCTCTA 


CAGGCTTCTA 


82260 




CCAGTTTGTG 


CGGCCCCACC ATAALLALLA 


rTTTr a rzcz a cz 


t^TrTHTATCr 

\J 4. \— i O i -Tl I \*V, 




82320 




ACGAGGCTGT 


GGCTATCGGC CTLALLALAL 


p a t t c r p p cz a 

Lnl XLLLL-^A 




CAPAGPPnCT 


82380 




CCTGGACCCC 


ACTGGTAAAT GGGGCllLAL 


L I LjoLjALL-V, 1 


pAnAPTPfTCi 




82440 


5 


AGTGTGGGCC 


AGAGTCCTGG GCTGCTTGGG 


L 1 LLLL A 1 LL 


tp pp. pp c c tcz 


PTTHPPPPPP. 


son 




CCTCTCTGTT 


CCCCTATGGG AGTGATGGGG 


LLLTLLALL 1 


L L ALL AL L AVj 


P APP APP APP 
LAL L AvaL AH— 


OiJOU 




ACCACCTCCA 


CCTTCACCAC CACCACCTCC 


AL L AL L AL LA 


LL lLLaLLAL 


ptpp arrTrP 
L 1 LLALL 1 LL 


0 <5 0 ^ U 




ACCACCTCCA 


CCACCTCCAC CACCTCCACC 


ALLALLALLA 


LL1LLALLAL 


PAPP APP APP 
LALLALLALL 






ACCACCTCCA 


CCACCACCAC CACCACCACC 


ACLTLLALL 1 


LLALLALL IV- 


LnLLnLLnLL 


q 0 7 APi 


10 


TCCACCTCCA 


CCACCACCAC CACCTCCACC 


TL CALL ALL 1 


LLALL ILLAL 


PTPP APP APP 


OiiOUU 




ACCACCTCCA 


CCACCACCAC CACCACCTCC 


ACLTLLALL A 


pp app a pp at* 
UCAbLAuLA 1 


PaPTTPTTPP 
LAL 1 ikj 1 1 \jKj 


Q OQ fif) 
OiODU 




GGAGACCCTG 


TGCAACTCCA TGCACAGCCC 


TGTCCCTGCC 


TiTAPPPPPP a 
A 1 ALtL. L L L LjA 


PPPPT A APP A 


O <i J <d u 




CAGCCCTGTC 


CAACTGCCAC ACGTCCCCTG 


CCTCCCATGC 


A ILL ILL ILL 


HjLL i L AAL 1 


q 9 q a ft 




GCACACGCCA 


GGGTCCTAGG GTCCTAGACC 


CCTGTCCTCC 


L 1 L 1 1 IL lut 


PTPTPTTTPP 


Q \ C\AC\ 


15 


GGTGGAGTCC 


AAGTCTCCAG AGGCGGAAGC 


ATCTGTGTTC 


L 1L1L1 lnni 


pa apappppp 
VjAALAVjLL-LL 


ojiuu 




TACAGAGTTC 


CCCTAGTTCA CCCAGGGGGG 


AACCTAGCCT 


Li 1LLLALLA 


cccc apitpp 
LLLLALA1 LL- 


Oil fTPl 




CTTCTGGGCT 


TGGTACTCAC TGGGATATCC 


TCATGCCTGC 


ALLLALLL 1A 


LbLL 1L ILAL 






CTCCTGAGTG 


GGGCTTTGGC CTGCCCGCCA 


CTGTTCCAGC 


LLLLA1LLAL 


L ALtLjL, 1 VaL 1 Lj 






TCTCCTCTGA 


TGCCCCCAGC ACCCAGGCGT 


GTACCTGCCT 


LLL1 1 I 1LLL 


rr 1 pr* tp p t p t 
ULLL 1 LL I L, I 




20 


GAGGTGGGTG AGGCCTGGCC TCCCTAGCCA, GCCCTGCCCC 


CCCACCCCAG 


pp A Apfprnmpp 

LvjAAL, 111 L.L 


0 j-ft u u 




AGATGCTCCC 


GACCAGCTTT GTGGCTCTAC 


ATCTCTTCAT 


CAGGAAGAAC 


LL*L,L3L_L,LjL3A 1 


OJ40U 




CCCAAGCTGA 


CCGTGTCCAC GGCTGCAGCC 


CAGCAGCTGG 


ACCCCCAAGA 


P.P APPTP A AP 


OJJiU 




CAGGGCAGGC 


CCCACCTGTC GCCCAGGCCA 


CCCCCAACAG 


GTAGCTGACT 


PPTPA APPP.T 


83 580 




GTGCAGCCTA 


CGACTTGGTG GGTCCCTCAG 


TGGCTTCACG 


AGGCTAACTC 


TTP,AP,TPTPP, 


83640 


25 


CCGGGGCTGC 


CCCTGTGGGG AGCCATCTCA 


TGGTGGGGAC 


TGCTCCCGGT 


TCTGCACCCC 


83700 




GCAGTTGTCC 


TGAGCAGCTC TCCAGGAGTT 


CCTGGAGGAA 


GGGCGGGCAG 


gll lltllla 


O T *7 C Pi 




CTCTCAGTCC 


TCCACCCCAG CGCCACTCTG 


AGCCATGCTA 


CTCCCACACC 


AGGAGACCCT 


83820 




GGCAGCCAAC 


CACAGTGGGG GTCTGGAGTG CCCAGAGCAG GGAAGCAGGG 


CCAGCACGCC 


83880 




GTGAGCGCCT 


ACCTGGCTGA TGCCCGCAGG 


GCCCTGGGGT 


CCGCGGGCTG 


TAGCCAACTC 


83940 


30 


TTGGCAGCGC 


TGACAGCCTA TAAGCAAGAC 


GACGACCTCG 


ACAAGGTGCT 


GGCTGTGTTG 


84000 




GCCGCCCTGA CCACTGCAAA GCCAGAGGAC 


TTCCCCCTGC 


TGCACAGCAA 


GTGGCCCTGG 


84060 




CGTGGGGAAC 


AGCCGGTGGG GTGGGGGGCA 


GGGGACAAAA 


TGGGGGCTGT 


GCCGGGTCTG 


84120 




ATTGAAGCTC CCCGCAGGGT TCAGCATGTT TGTGCGTCCA CACCACAAGC 


AGCGCTTCTC 


84180 




ACAGACGTGC ACAGACCTGA CCGGCCGGCC CTACCCGGGC 


ATGGAGCCAC 


CGGGACCCCA 


84240 
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GGAGGAGAGG 


CTTGCCGTGC 


CTCCTGTGCT 


TACCCACAGG 


GCTCCCCAAC 


CAGGTAGGGC 


84300 




ACCTGCCTGG 


CTGCTCCTGG 


CAGCGCCCCA 


ACCGCACGCA 


GCCCTGGGAG 


TGAGCAGCAA 


84360 




AGCCCCAGGC 


CCCCCTCAGA CTCAAGTCTC TGTCTCCAGG CCCCTCACGG TCCGAGAAGA 


84420 




CCGGGAAGAC 


CCAGAGCAAG 


ATCTCbTCL v 


1 L.L. 1 1 AVjAL-A 






84480 


5 


GGGCGGGCGG 


TGAGGATGCA 


GGTCCCAGLL 




A.L.V. ItLLLAL 




84540 




CATCTGAGTG 


GGGTGAGCCT 


CATGGGAGAG 




GL AbL. AGbi-L 


fi. PPPP PTTP 
AL-o*jjGAGG IG 


84600 




CGGGCGGGCC 


CCTCTCAGCA 


GGCTGTGTGT 


GCCAGGGLTb 


nrpprrp afar 
1 bGbGL AG AG 


GAGG 1 GG 1 GG 


84660 




CCTTCCAGTG 


CCCTGCCTGT 


GACTTCCAGC 


GCTGCCAAGb 


C FGL 1 GGG AA 


p*^ , ^ , ^ , appttp 
UGGGAGL11G . 


84720 




AGGTTGGTGC CTGGCCACTA 


CAGTTCCTGC 


TGGGTGTAGL 




GGGG 1 G AGGG 


84780 


10 


GGAAAGGGCA 


GGCCCTTGTC 


CTGGTGGCAA 


CGCLTGGLAb 


ALG Ibl AG 


IGGo4~LGGl i 


84840 




GTCTCACAGG 


CCTCTAGGAT 


GTGCCCAGCC 


TGCCACACCG 


G LTG L AGGAA 


GG AGAGG G 1 L 


84900 




ATGCAGGTCT 


TCTGGCCAGA 


GCCCCAGTGA 


GTGCCCACGG 


AGGCCCCCAG 


f s AP , AP"P"P , A AP 1 " 

GAGAGGLAAG 


84960 




GTGGCTTGAT 


CACCTGCCTG 


TCCAGCTCTG 


GTGGGCCAAG 


AACCCACCCA 


SpliPfl A T APP 

AG AGAA i AGG 


85020 




CCAGCCCATG 


CCAGCCGGCT 


TGGCCCGCTG 


CAGGCCTCAG" 


GCAGGCGGGG 


nnp7\ mpPTTP 

CCGATGGTTG 


85080 


15 


GTCCCTGCGG 


TGGGACCGGA 


TCTGGGCCTG 


CCTCTGAGAA 


GCCCTGAGCT 


a P i r ,r PT , r , r*p , p*T i 
ALU 1 1 bbbo I 


85140 




CTGGGGTGGG 


TTTCTGGGAA 


AGTGCTTCCC 


CAGAACTTCC 


CTGGCTCCTG 


rrrTPTP apt 
GGU HjI GAG 1 


85200 




GGTGCCACAG 


GGGCACCCCA 


GCTGAGCCCC 


TCACCGGGAA 


GGAGGAGACC 


GGGGTGGGGA 


85260 




CGTGTCCACT 


TTTAATCAGG 


GGACAGGGCT 


CTCTAATAAA 


GCTGCTGGCA 


C r V( r *rTT' APPJ 

G 1 GGGGAGGA 


85320 




CGGTGTCTTC 


GTGGCCTGGG 


CTTGGTGGTG 


GGAGTTGAGG 


GAGAGGGAGT 


T'OP^P' AP7\ ppp 

1 GGG AGAGG L 


85330 


20 


. CCCTCCCAGC 


CTGCCATGTG 


ACACTGTACT 


TCCTCCACGG 


TGGGCTCAGC 




85440 




TCCTCACAGC 


CGCAGCCAAG 


CTGCAGTTGG 


TAGGGGATCC 


ACCGACACAC 




85500 




GGGCTGGTCT 


CTGGGTTGGG 


AGCTGLCCLA 


GGTGCTGAGG 


AGGGCAGCTC 


CCTGGCTGGT 


85560 




GAGGCCCCTC 


CCAGAACCAC 


CCTTGGACTG 


AGCTCTGGGG AGGGATGGTA 


CCAGGTGGGT 


85620 




GAGGGGGGCT 


GCCTGGGGAG 


GGAGGGGTTC 


CTATGGGGCG 


TGGCGAGGCT 


GGCCCAGCCC 


85680 


25 


TCTCCCCGCC 


CATATATGTA 


GGGCAGCAGC 


AGGATGGGCT 


TCTGGACTTG 


GGCGGCCCCT 


9 j-7 4 0 




CCGCAGGCGG 


ACCGGGGGCA 


AAGGAGGTGG 


CATGTCGGTC 


AGGCACAGCA 


GGGTCCTGTG 


85800 




TCCGCGCTGA 


GCCGCGCTCT 


CCCTGCTCCA 


GCAAGGACCA 


TGAGGGCGCT 


GGAGGGGCCA 


85860 




GGCCTGTCGC TGCTGTGCCT 


GGTGTTGGCG 


CTGCCTGCCC 


TGCTGCCGGT 


GCCGGCTGTA 


85920 




CGCGGAGTGG 


CAGAAACACC 


GACCTACCCC 


TGGCGGGACG 


CAGAGACAGG 


GGAGCGGCTG 


85980 


30 


GTGTGTGCCC 


AGTGCCCCCC 


AGGCACCTTT 


GTGCAGCGGC 


CGTGCCGCCG 


AGACAGCCCC 


86040 




ACGACGTGTG 


GCCCGTGTCC 


ACCGCGCCAC 


TACACGCAGT 


TCTGGAACTA 


CCTGGAGCGC 


86100 




TGCCGCTACT 


GCAACGTCCT 


CTGCGGGGAG 


CGTGAGGAGG 


AGGCACGGGC 


TTGCCACGCC 


86160 




ACCCACAACC 


GCGCCTGCCG 


CTGCCGCACC 


GGCTTCTTCG 


CGCACGCTGG TTTCTGCTTG 


86220 




GAGCACGCAT 


CGTGTCCACC 


TGGTGCCGGC 


GTGATTGCCC 


CGGGTGAGAG 


CTGGGCGAGG 


86280 
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GGAGGGGCCC 


CCAGGAGTGG 


T G GC L Cj U A tj <j 


T* H TT* flT* A C P C 






O O J4 U 




CTTGCACCCT 


GAGCTAGGAC 


ACCAGTTCCC 


CTGACCCTGT 


TCTTCCCTCC 




OOHUU 




CACCCCCAGC 


CAGAACACGC 


AGTGCCAGCC 


GTGCCCCCCA 


GGCACCTTCT 


c arrr t\ pp7\p 


q c a ezn 




TTCCAGCTCA 


GAGCAGTGCC 


AGCCCCACCG 


CAACTGCACG 


GCCCTGGGCC 


TGGCCCTCAA 




5 


TGTGCCAGGC 


TCTTCCTCCC 


ATGACACGCT 


GTGCACCAGC 


TGCACTGGCT 


TCCCCCTCAG 


gccon 
OOJOU 




CACCAGGGTA 


CCAGGTGAGC 


CAGAGGCCTG 


AGGGGGCAGC 


ACACTGCAGG 


CCAGGCCCAC 


Q C C A f\ 

o bb4U 




TTGTGCCCTC 


ACTCCTGCCC 


CTGCACGTGC 


ATCTAGCCTG 


AGGCATGCCA 


GCTGGCTCTG 


ob / UU 




GGAAGGGGCC 


ACAGTGGATT 


TGAGGGGTCA 


GGGGTCCCTC 


CACTAGATCC 


CCACCAAGTC 


ob / bU 




TGCCCTCTCA 


GGGGTGGCTG 


AGAATTTGGA 


TCTGAGCCAG 


GGCACAGCCT" 


CCCCTGGGGA 


o bo^U 


10 


GCTCTGGGAA 


AGTGGGCAGC AATCTCCTAA CTGCCCGAGG GGAAGGTGGC 


TGGCTCCTCT 


obooU 




GACACGGAGA 


AACCGAGGCC 


TGATGGTAAC 


TCTCCTAACT 


GCCTGAGAGG 


AAGGTGGCTG 


Q C Q A f\ 




CCTCCTCTGA 


CATGGGGAAA 


CCGAGGCCCA 


ATGTTAACCA CTGTTGAGAA GTCACAGGGG 


o / UUU 




GAAGTGACCC 


CCTTAACATC 


AAGTCAGGTC 


CGGTCCATCT 


GCAGGTCCCA 


ACTCGCCCCT 


87 060 




TCCGATGGCC 


CAGGAGCCCC 


AAGCCCTTGC 


CTGGGCCCCC 


TTGCCTCTTG 


CAGCCAAGGT 


O / liU 


15 


' CCGAGTGGCC 


ACTCCTGCCC 


CCTAGGCCTT 


TGCTCCAGCT 


CTCTGACCGA 


AGGCTCCTGC 


o / lb U 




CCCTTCTCCA 


GTCCCCATCG 


TTGCACTGCC 


CTCTCCAGCA CGGCTCACTG 


CACAGGGATT 


8/^40 




TCTCTCTCCT 


GCAAACCCCC 


CGAGTGGGGC 


CCAGAAAGCA GGGTACCTGG 


CAGCCCCCGC 


87300 




CAGTGTGTGT 


GGGTGAAATG 


ATCGGACCGC 


TGCCTCCCCA 


CCCCACTGCA 


GGAGCTGAGG 


87 3 60 




AGTGTGAGCG 


TGCCGTCATC 


GACTTTGTGG 


CTTTCCAGGA 


CATCTCCATC 


AAGAGGCTGC 


o /42 U 


20 


AGCGGCTGCT 


GCAGGCCCTC 


GAGGCCCCGG 


AGGGCTGGGG 


TCCGACACCA 


AGGGCGGGCC 


OT/I OA 

o / 4oU 




GCGCGGCCTT 


GCAGCTGAAG 


CTGCGTCGGC 


GGCTCACGGA 


GCTCCTGGGG 


GCGCAGGACG 


O / 04U 




GGGCGCTGCT 


GGTGCGGCTG 


CTGCAGGCGC 


TGCGCGTGGC 


CAGGATGCCC 


GGGCTGGAGC 


o / buu 




GGAGCGTCCG TGAGCGCTTC 


CTCCCTGTGC 


ACTGATCCTG 


GCCCCCTCTT 


ATTTATTCTA 


O/otJU 




CATCCTTGGC 


ACCCCACTTG 


CACTGAAAGA 


GGCTTTTTTT 


TAAATAGAAG 


AAATGAGGTT 


07 7 o n 


25 


TCTTAAAGCT 


TATTTTTATA 


AAGCTTTTTC 


ATAAAACTGG 


TTGTAGTTGC 


ACAGCTACTG 


077 on 




GGAGGGCAGC 


CGGGGACACC 


TGAGCCGCCC 


GCTGTGCCCA 


GATCCCTCAG 


GCTGCCTGCC 


87840 




ATCAGAACTG 


CTGCCCGGGG 


CTTCCCCTAC 


CTCAGACAGA 


CCCTCCCTGG 


GAGGATCAGT 


87900 




GGGGAGTGCC ACCTCTGCCC 


CCAGTGGCTG TGGCACGTGG CAGGGGCCCC TGAAGCTCAG 


87960 




CGAGGGTCAG 


GGCCTGGGAG 


GGTATCATTG 


CTGGAAGAAC 


AGGATGGGGC 


TCAGGCCAGC 


88020 


30 


CCTAGTCGCC 


GGGGCCCACA 


CTAACCCCCC 


ACTTATGAAT 


TCCTCCCACT 


CCCAACTCAC 


88080 




AGGGGATTTC 


CCGAGAGGGG 


ACCTGCCAAA 


GACCTCCTCC 


AGGCCTCCCA 


TGCTTCCCGG. 


88L40 




GAAGTGAAGC 


TTCTCCCCCT 


CTGGGGCAGG 


CTCTGAAGCC 


TCCCGATGCA 


CCCAGAGCAA 


88200 




CCAGGGGGCT 


GCACCAGCCA CTCGCCTCCC 


CAGCACGGCC 


AGGTTCCCGG 


GGCTGGAGGT 


88260 




CCCCCCCAGG 


TCCTGGGAAC 


CAACCTGCAG 


AACACACACA 


GGGTCCCCTG 


GAGAGGACGC 


88320 
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GGGGACTTCC AGGGCCCGAC TCCTGTGAGT CACAGCCCCG CAGCTGCTGC GCCACCCCCA 88330 
CCCTGACTCA TGCCCCTTCC CAGCAGCTCC TCCCAGGACC CCATGTCCTT CCCACATCCG 88.440 
CAGGAAGGGA GTGCCTGGAC TCTCCAGGCC CACCTGGGGA GCCCCTCACC TGCCCACCAG 88500 
. CCCCTGAGCA GCCCAGTAAC ACCATCACCG TGTCCAACAG CCAGGAGCCT CCACCCTCCA 88560 
5 GGAGGGAAGG GATGGACAGA GCCACACTCG CCGTCTTTAT TTTGCACTCA CCCTGGGTGA 88620 
CACTGGGCAG GCCGCTCCTG CCCACAGCCA GACTGAGGAA GAACACAGCA CTCGGCAGGC 88680 
CCAGTGGGGT CCGTGCAGGG AGGACCCCAG GACCAGCCTT ACTCCCGAGC AGGGGACACA 88740 
GGGCCCCACA GAGAACCCCT CCGGGAGGTT CTCTCCTGGC TGGGGGAGGG CTCTGGACCC 88800 
CCACAAACAC TCCCCAACTT GCGGGGCTGG GGCATAAAAA CAGCCACTCC CAGCAGGCCC 88860 
10 CCTCAGCTTT TTGCATCAGT CAGCTCCCTC CCGGGGGATT AGGGTGAGGT GAAGCCAGGC 8892 0 
CCAGGCGTGG GGTATAGGTC TTCCCCCGCA GGCCTCAGCC CTGTCCCGAG GCTGCATCAC 88980 
AATCCAGGGC CCCCGCTGGC CTTTGGGAAC ATGGCCTGGG TCTTCCTCAA GGCAAGATCA 89040 
GCCCCAGACC ACTTCCGGGG TCACGGGGTC ACAGGGCAGA AGCCAGATGG CAGCCATGGC 89100 
TGACGGGCCT CCTCCTCGAT GGGGCGGAGA CAGCCACGGG GTCTCCCGAG GGTCCCACAG 89160 
15 GGCTGTCCTC ATGCAGCCCA AGCCAGCCTG AGCACTGGAG CCCCAATTCC CAACCAGGTC 89220 
TCCCTCAGAC CCCCCAGAAA GGGCCTCGAA AGGCCGCCGC TGCGCCCTGT GGAAAGGCTG 89280 
CCGCTGCAGG GCCTGGGCCA GCCGGGCTGC CAGACTCCCC TCCAAAGCCT CCGGATGCCT 89340 
ACGCTTTTCC AGACATAGAG GAAAGTTTGT CTTCGAGAAA ACAAAGTAAA TAGAAGAACC 89400 
CCAAAGCAAA GCAAACCCAC CCCCCAGATC AGCAGCATGG GAGCCAACAG GAGGCCACTC 89460 
20 CTCCAGCACC AGGGGACCAG CCGTCCCGAC GGCAGCGCGG CTGCGCCTAC GTGATGTCCC 89520 
TCTGCCGCGG CGGCCGGTGC ACATTCCGCA CGACACACTT CACCATCCAC TCGATGCCCT" 89580 
CGCGCACCCC TTTGCTGTGA AGACAGCGGG TGTGAGGCGG GGGGTCTCGG TCCCCAAAGC 8964 0 
CCCCGCAGGT GCAGCCCCCA CTCACCCTGT GAGGGCCGAG CAGGCCTGGG TCAGGCAATC 89700- 
' • GCGCCTGCCG ATCTTGCTGG TGCAGTCGCT GAAGGCCGTC TTGATGTCAG GGATTGAGAG 89760 
25 GCACGTCTGG GGGAGGTAAG GCCGTGAGGA GCAGCCCCCA CGTCTGGCCC TGTCCTGCCT . 89820 
GTGGGCCCGG GACTCTCAGA AGGGCGTATG CCCTTCACCC CAGGGAAACA GCCAGAGCTC 89880 
CACCAGGGTC CCAGTGTCTC CCACAGAGAC CACAGCAGTG AGGACCCTGT GCTCAGCCCG 89940 
AGGCTGAACA TGGCTGGTAG TGCCTGAGAC AAACTAGACG TCCACACGGC TCCAAGGAGT 90000 
CCACCCCCCA TCCCCTCCCT GGGGGACACC CTGAGCCCCG . AGGTGGGGCG CTGAGGACTG 90060 
30 AGGCCTCCTG GGCAGTGGCG GAGGCAGGTC CCAGGGGCCC ACACAGCCGG GGATGATGGA 90120 
GAGGTGGGAG CCCTGCATCA GTGATGGGGG CAGTCTGCAG TCATGGTGGC TTCTGCTCAC 90180 
AACCACCTGC CCAGTCTTCA AAAAGCAGCC CTCCCCTCCC CTTTTCCTCC GAGGGGAGAC 90240 
CCCTGCCCCG TACCAGATGT CCCTCTTGTC GGCTGAGATT GTAGGGGAGG CCAGCCTTAC 90300 
AGGCTGGGGG CAACAGAGCC ACCCCAGAGA AGGCAGGAAG TGAAGATTCA CCCGGCCCTC 903.60 
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TGGACGCCGG GCTGCTTCTG TGCAAAGCCA CTCCAAGAGA ACAGCTAGAA. CTCAGCGTGG 90420 

CCAGTGCTCC CGGGGGCAGT GGCACCTCAG AGGGGTCTTG AGGGGCTGCC CTGGGGGTGG 90480 

GGCTGGCACA GATGCCACCT CCAAGGGTAG CAGGAACAGG TAAGGGTCAG AGGTGACTCC 90540 

r CACCAGGGCC CCAGCATCAC TTCTTTGAGC TCTGAGTTTC ACCTGGGTGT CCCCACAGCT 90600 

5 TGGCCACACA CTCCTGAGAC ACGGCCGCCC TCCTGGGGAG AGGTGCCCTG CATAGCAGGA 90660 

AGAGGCCTCT GGGCGCCTGC CCTGAGGTGG GAGAACCTCC AGGGCTGGCA GCAGCAGGTC 90720 

TGGAGAGGAA CCAAGCTTGG GAAGCTGCTG GGGGCAGGGC AGGCCTTGAG AATGGCTCTG 907 80 

TACCCCCTGG GCAGTCACTG GGCCTGGGGT GTCTGGGTGC ACACCTACTC CCCTTGCTGT 90840 

GGGGGAGGCT GGGGACTCGG GAAGCTGCTG CGGGAGGCAG GGGTGGGGCT CACCTCCACA 90900 

10 TCCTGCTTGT TGGCCAGCAC CAAGACGGGG ACACCGCACA GCGCCTCGCT GGTCACCACC 90960 

TTCTCTGGGG AGGGCAGGAG AGGCAGCGCC TCACACCCAG CATCCTGCCT CTGACTGCCC 91020 

AGGGGCCCAC AGGCGTGGAC ACTGTGACAG CCACTCCCTC TGCCCCCCCC CCGTCACCCA 91080 

CTAGGCAGGA GCACTTCTGA CCAGACACTG AGCCTGCCCC AGGCACAGAG CTGCCCAAGC 91140 

TGGACCTGCC CCCACTCACC ATCCATCCCT CCCAGAGCAG CCAGGCCGCA CTCACCAAAC 91200 

15 GCCTGCTTGG ACTCAGCCAG CCTCTCCTCG TCGGTGGAGT CAATGACGTA GATGACGCCG 91260 

TGACACTCCG CATAATACTG GGAGGAAGCA CCAGGAGTTG GGGCTCAGTC CCCACCCTGC 91320 

CAAGGGCCAG CAGAGCCAGG CCTGTGTCAT GGCCACAGT.G AGGGGCTCAC ATGAGGAAGG 91380 

GGCAAGAGGG CAGCCCCCAA CTGCAAGACC CTTCTGGGAT GCATTCTGGG GTTGCGGGGA 91440 

GATCTGGTGG AGGTGTCCCC AGACGCTGCT CCTGAGAACC TGCCGGCAAC CTTTGGCCTG 91500 

20 ATGGTGGCCA AAGGTGAAAG ACAGGGATTG GGCCAGGCGT GGTGGCTCAC ACTTATTATC 91560 

CCAACACTTT GGGAGGCAGA AGCAGGAGGA TCACCTGAGC CCACTTCACG GCCAACCTGG 91620 

GCAACACAGT GAGACTCCGT CTGTACAAAA GCTTATGGTA ATGTGCGCCT GCAGTCCTAG 91680 

CTACTCGGGA GGCTGAGGTG GGAGGATGGC TTGAGCCTGG GAGGTTGAGG CTGTAGTGAG 91740 

' CTCTGATCAC ACCACTGCAC TCCAGCCTGG GTGAGAATGA. GAGACCCTGT CTCAAAAAAA 91800 

25 AGATAGGGTT TGGGGGCTGG AGGAACCTAG ACCACAGCCT GGCCCGTTGA GGGAGTGCAC 91360 

CTGTGGGGCT CTGTGCCAGC ACCTCGCACA GGGAGGGAGT GTGGCCATGC GGATAAGACT 91920 

GACCAGCACC ATCTACGAAG CGAGCCTTCC CTGCCAGGAC AGGGCCAGAG TCACTGAGCT 91980 

CAGACCTCTG CAGCCTGGGC TGGTCAGTCC TGGGCTCGCT GGCAACACTC CTGGGCAAGA 92040 

CAGGGCACAG CCCCTGCAGC CTCAGGTACA AGTGCTGAGC CCTGGACCAG ATGAGTGCAC 92100 

30 CTCTATCTCA ATCAGAAAAA AACACAGCAA ACTCCGCGTC CACGTGGAGC AGACAACAGC 92160 

TCACATTTGC CACTTTGCCT CCAGGCTGTG CCAGCTCTCC TGTCCAGGCA TGAGTGCCCA 92220 

GAGACCTAGA ACTGGATGCT GACCAGGTAG GACAAGCTGG TGGTCAGTGT GTTAAGACAC 92280 

ACACACCCGA GAGCATGAGA AGCCAGGAGG CACAGCCCAA CTCTCCGAAA TCCTTAGGGT 92340 

GTCTGAGCAG "GGAGTACCAG ACAACCCCAT CCCAGTGCCA 'GACAAGCTTG TGCACCTGCA 92400 
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CTTCCCACAG 


AGGAGAGAAG 


CCTGTGCACC 


TGCACTTCCC 


AbAbTbbAAA 


bbAbbAbbCb 


92460 




CAAGGCCAGG 


CCCCCCCACC 


CCCAGGAACT 


TCCCACAGTG 


GAGAGGAGbb 


CCAAGGbbAG 


92520 ' 




GCGCCCTCCA GGGTTCTGCA GGTAGCGAGG CCCCCCCACC 


CCCAGGAACT 


m ttk r** tt\ 

TCTCTGGCCT 


92580 




ACAGACAGGT 


CCCACACAGA 


GGCCGCCAAC 


HCT'TP a ACICZCZ 


ACCCTGCAGT 


GTGCCGGCTG 


92640 


5 


TCTGCTGCTG 


ACACAAGGGA 


GCAGGCGGAC 


L.L. 1 AAtab 1 bo 


AGACCTCTGT. 


GGCAGGAGGG 


92700 




GCGGCTCTGT 


GGAGGCTGCA GCAAGCCCAG 




TCCACGTGGC 


TCCTGGGGCT 


927 60 




TCTGAGCAGG 


GTGGCAGAAG 


GTTCATGTGC 


AACLbbb I (-L 


TGGACCATGG 


GACCACGTGG 


92320 




CCAGAGCCAC 


CCATCACACC 


TACCAGGCAC 


AACjb 1 bb AC A 


GCCCAGCAGG 


GCCGCAGTGG 


92380 




ACGGGAGCGA 


CACCTCAGGG 


CTGAGTGCGG 


bL. A bb AC bb A 


GAGCCCCACG 


CCCCAGTGGA 


92940 • 


10 


GGCGTCACAG 


CAGTGGTCAT 


TGTGGGGTGC 


C b L AC AAbb A 


GGGGGAAGAG 


GGAGGTGTCC 


93000 




CAGCGTGGCT 


CCTGGCTGGC CAGCTGACCC 


CAbTbbAbbA 


GTCAGAGGGA 


CTGTGGGTCT 


93060 




GAGTTTTTCT 


CCCCAGCAGC 


AATGGGAGCT 


CbbbAAblbb 


AAAGTGCCAG 


CCAGCCTGAG 


93120 




AGACTAGTGT 


TACAGCAAAG 


AACCCAGGAG 


CTGAGbTbb I 


GGCACATGCC 


ACACATGTGG 


93180 




ACACCAACCC 


AGGGTCCAGC 


CCCAGGACGA 


GGCb AA T rbb 


CAATGACGCC 


CCTTTCTGTG 


93240 


15 


GTGCTGGCTC 


TGCACAAGGA 


TGCAGGATAC 


AGGAACCAGG 


GTGGGAGCAG 


GGGCCTCCCT 


93300 




TCCGGTCCCT 


CCCAGTGACC 


TAGGGGGGTC 


CCTGCAGCTG 


ATCCTCCCAG 


CTCTGAGCTC 


93360 




AGCAGGGTCA 


GGGGTCCCGG 


CCACTAGAGC 


AGCACATACT 


CAGCAGACAC 


GCTGAATGAC 


93420 




GAGCCACAGC 


TGCCTCATGG 


GCATGACTTG 


CACCTCATGT 


CTAGGAGACC 


CTGGTGGGCA 


93480 




GGAGATGGGG 


CTGCCATCCC 


ACAGCTGTCC 


CACAGCTGGG 


GACCCAGGGA 


GCCACTGGCC 


93540 


20 


CCACCACGGT 


GGTGTCTGGA 


GAAGGGCTCA 


GACTGCCAGG 


AAGTCGCACC 


CCAGCAGAAG 


93600 




TGGTAGTGAA 


TTGGGAGGGC 


ACTCAAGGAA 


GGGCTGTGCA 


GCCCCAAGAC 


CAGCAGCAAG 


93660 




GATGGGCTAC 


AGTGGCCCCC 


TTAAGTCTCC 


CTCTTCCAGT 


TTCGCCTTAA 


GAGAGGCCCT 


93720 




CAGGACCTTG 


GAGGAACCCC 


TCTCCAACGT 


GGAAGTGTGG 


GTCCACATAG 


GGCTGCAGCT 


93780 




GTGGCCAGTG 


CAGGCATCTC 


TGGCCCCACT 


GTATTCTTGC 


TTCATGTTGG 


AGAACACTGC 


93840 


25 


ACCAGCAGAT 


GGTCTCATTT. TGGTTTCTGT 


GGGACCCACT 


TTGGCTGCAA 


AGAGCCACAC 


y 39 ou 




TGCCAGGTCA CACCTGCCCA GGGCAGCCCA CACTGGGGAC 


CCACCAGGCC 


ATGGTGTGAA 


93960 




GTCCCGGCCA 


GCCTGGCCCC 


ACATGGCACA 


GCATAGCCAG 


TTCTCCTCCA 


GGGCTCCCTG 


94020 




CTGGGCCAAC 


CACAGCTCTG 


CGGATCCTGC 


TGCCTGAGTC 


GACCTCTCCT 


CTCCCGTCCT 


94080 




CCCTGCGTTC 


CTGGTGCCGA 


CCCCCAGTGT 


GCATCCTGTA 


CCTCGACCTG 


TCTCAGCATC 


94140 


30 


TGTGCCTGAG 


ACACCGGCCT 


GTGACAAGAT 


CATCATCATC 


TGTGTCACTC 


CCCAAGCATG 


94200 




CTGCGCACTG 


GACACACAGG 


CCCTGACTCA 


ACTTGTCCTG 


TCTGACTTCA 


GTGGTCCTAC 


94260 




AGGATCTATC 


AGAGATCACT 


TGGCCATGGG 


AGAAATGTCT 


TCTTGGCTAG 


AAGTCACAGC 


94320 




AGGAGGGGAC 


ACTTTGGGGG 


CGCCTAGGAA 


AGGGGAACTA 


GGATCAAAAA 


AGAGATCAGG 


94380 




ACCTGGGCAC 


TCAGCTCTAG 


AGATGGCATC AGGGCAGCCA 


AGGCACTGGG 


GACACCCCAC 


94440 
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ACCCACTGTG 


CCAGCCTAGG 


pp a ppp A fPP 

GLAGGbAbbL 


L. b Abb rvrtVjC \_ 


a c ix nczr tp TP, 








TGCTGGACTC 


AGTGCCTGGC 


PP A PPPTT i\ P* 

CbAGGLlGAG 


A Ar , P 1 7AP7A r P7A A 

AnubnbA i AA 


aPTr*P7APPPT 


TPPPPPTPTP 
1 bbbbb i b i b 


34 jOU 




GGGAAGGGGC 


ACCACACTGG 


P A rflPTiP A P A A 

GATbTLAGAA 




ppTPTPTP A A 


A AT APP A PAP 


Jfi 0 £. U 




TGCCGCTGTG AGACCCTGAG 


GAGTCTTCTG 


P'TV A TP ATTr 

GTGA 1 LA 1 GG 




PTTB aPPTSP 
b 1 1 Hribb 1 Ab 


Q >t CO ft 


5 


a aptpa appb. 
ML IbAAbbA 


AppTP ATP A C. 


GGGAGAGGCA 


pppivrfpiipp 
GLLAiLL 1Gb 


LG1LLLLALA 


rp pi mp» p> rn pi rpm rp 
1 L lbb XL ill 


Q /I *7 /l n 




p> pp a > m pf TlpT'P 

GCL A rTTL I G 


tptp p t p tp p 


TGGTCAGCAG 


CAAGGTCTlT 


G AGL L GAAAG 


PA PPP A P" rpp a 

GAGGLALULA 


Q /t o r* Pi 

y 4 ouu 




CTTTGGAGGA 


ptp P 7A PPPTP 
b 1 bb Abbb 1 b 


CCCAGGTCCC 


CACACTTTGT 


/—» mnip ITlP* P* TIP" A 

C TT GTL L TG A 


P"T*PAP'A A a p< a 

C T G AG AAAG A 


Q /I O C A 




A A P A P A-P T»P P 

AACAGALTGL 


Lb 1 bALb lb 1 


CTGACTTGGC 


CAGCGAGGTT 


GCCCTTAGGC 


TCAAACCCAA 


94920 




p p p a p p pmmm 

GCCAGGGTTT 


p 7\ a p 7a ttppp 
GAALAJ. I LLL 


AGACACTTGT 


AAGATGTTTA 


GGTTGTTAAL 


A rp A A mpTrfP A 

ATAATGTTCA 


O A Q O A. 

y 4 yyu 


10 


GGTTTCAAAA 


LA I 1 bAAAbA 


AACTAGCCCC 


AGCCCTGAAC 


p»p« A P 1 ATPPPP 

llagatllcl 


P*P* P P'P' P"T>T'P* A 

LLL ggc TTl a 


Q C A /I A 

y b U4U 




GGCATGACCA 


GI GAAb ALGb 


CCTTCTCTCA 


ctggtcacl r 


P APP 7\ TPPPP 

gaggai gllg 


P A PTP'PP'frP A 

LAL1L 1GTLA 


QCi Art 




ACAGGTTCCC 


r>m a a m t\ p a <t*p* 

CTAATALATG 


CTCTGATCTG 


ATCGCCTTGG 


P 1 a mrrirn a pmp A 

LATTTAG TGA 


TTCTTTCCCT 


y j j. 0 u 




GGAATTCTCC 


ACTGGbLLLA 


TCGCAGGGAA 


CTCCCAAGTG 


GGAAACTlll 


prrtji p»p» A P»P A C 

lTAlCACCAC 


n c "i ~) a 




TTTTGGGGCA 


ALTTL AGL J. A 


AGGGTTCAGC 


TGGGACAAAA 


pi A P'P'P'A PPP7I 

CAGGGAGCCA 


CTlGGGAACC 


O C *1 Q A 

9D^oU 


15 


T GGG AC A.GG A 


PPAP7APAPa X 

L L AG AGAb AA 


AACCCGAGGG 


ACAGAGTGGG 


TAAGGAAAGC 


mppnPA/^PA a 

TGCTGAGGAA 


Q C O /I A 




GGGCCCAAAG 


ppp* PTpTrr 
GGCAL ib I GG 


AAAGAAGTGG 


CACTGGAGGG 


CTGGGGTGGG 


GGTGGTCCTG 


95400 




GCCAGGGAGT 


CTTACCTTG1 


CCCACAAAGA 


CTGCAGCTCT 


TCCTGCCCTC 


CTAAGTCCCA 


95460 




GAACATGAGC 


pp7\ p , P , P , T"T u T , / rs 

CGAGLLT. 1 iL 


CCACATCCAC 


AGTGCCGACT 


pipi p> p» 71 pT, PPT, 

GGGGAGAGGA 


PPA A APAPPP 

GGAAACAGGC 


O C C O A 

9dd^u 




AAGGCTCATG 


ACCTTGGTCC 


TCGACACACC 


CAGTCCCAGC 


TCTCCCAGGG 


PATIPPPPPA A 

GATGGGGCAA 


Q C C O A 


20 


ACCATGCTGG 


TGCCACTCAA 


ATGAGACTTG 


AGAGGGGCCC 


GACAGGGCTG 


mppppA pppp 

T GGC C AC GGG 


O C C A A 

95540 




CCAGCTGGAC 


TGTGAATATC 


ACGGCATCCT 


CAAGGCCCCA 


AAC C L AC AGC 


CTGCTATTGA 


O C "7 A A 




GACCCTTACT 


GTTTAGGCCC 


ACGGTGGTGG 


TGATTTTGGA 


m A /"* A PrriP' A TP 

TAGALTLATL 


P PPIWPT A pm 

LLL TTGTAGT 


a r 7 c A 




TCTTGTTAAA 


TCGGGTTTTC 


GACTGCTCCA 


GGAAGGTCTG 


AGGAbAbAvjG 


P APAPPPPJ A 

LAbAbbLbAA 


y Do-iU 




ACACATCAAG 


GAGGGGCTAT 


ACTGGCTTCC 


AAA T 1 A *T>P , P"T ,r P 

AAArATLLl i 


AL I L Abb 1L1 


pmrnprprprpA a a 

b 1 1 L 1 J. 1 AAA 


QtAsnn 


25 


AGACAGAAAC 


AGAAACAGAG 


/"* A A PA pfnPWP 

CAACACTCTG 


ptmp'r-nrpp* A p'p A 

L i L i 1 LAbbn 


p.pptpptpp:t 

bbb 1 bb 1 bb 1 


PAPTATPPTP- 


? jj4u 




CCGTCTCAGG 


TGAAATTTGG 


CTTCCGTCTG 


GGTAGTGAAC 


GTGCAGCTGA 


CAGCACAAAA 


96000 




CCGAAGGGGG 


CGCCGCCAGG 


CCGTGGGAAA 


GGTGCGCGCA 


AGGGCGTGGG 


CACTCACCGT 


96060 




CTTCCCAGCA 


TTGTCCAGGC 


CCAGGATCAG 


GATGCAGTAC 


TCGTCCTTCT 


GAAACATGTA 


96120 




CTTGTACAAG 


CCCGACAGCA 


GCGTGTACAT 


CCTGCCCTGG 


GCACCCCAAC 


ATAGGTCAGT 


96180 


30 


GTGCAGCCAG 


AAAGCACCTC 


GCCTCCCCCG 


GGCTTCTCCA 


CGGTGGTCAG 


TGGCGCCCCA 


96240 




CGTGCAGCCG ACCGCTCAGG 


ACGAGAGCCT 


GGGGGCCATT 


CCCGACTCCT 


CGTCCCTCTC- 


96300 




CCACCCCGTC 


CCTCTGTAAC 


TTCTCCCAGG 


TCAGCCGCCA 


CTGTGTCCTG 


CTCACAGCAA 


96360 




TGACTGCGAC 


CTCTCCGCAT 


ACACATCGGT 


TCCGGCCCCT 


CCCCTGCTCG 


CGGGACTACC 


96420 




CAGCCGGGTG 


TTCACAGTGA 


GCTCAGCCGC 


GCTCCCGCCC 


TCCCCCGAGG 


CTTCGCTCCC 


96480 
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ACGCTTCACG CGCGCGGAAC GGGGAACACA 
CCTCGAGCGC CAGCCCCGCG CCCCACCCGG 
AGCCCCTGCA GGCGCCGGGT AGGGACGCCC • 
CCTGGGGGGA GCGGACTACA GGGCGGGTGA 
5 ■ TGGAGGCGGC GCGCGGCTGG CAGGGAACGA 
CGGCTACGTC CGCCCCAAGC CGCGGCCGCC 
GCTGGCTCCT GTCACACCCG CGTCCCACCT 
CCTCACCGCA CAGCCTGCGG CCTACCTGCG 
ACCCCCGCTG ACCTCCGCTG ACCCCGCGCT 
10 CCGGCCTCAG GGAATGAGCT . GAACCGCGTC 
GCCCCCGCGC GCCAAGCAGT TCCGGAAGCG 
GCTGGGGCGA AAAGGGGGCG CCGGGCCGCT 
GCTGCGGCTG GGAAACCGCG CGGAGGAGGT 
GCGGGGACGC GGCCCTGGCC AGACAGAAGA 
15 GAAGGCGGCG GGCGCGGCGG GTGGGCTCGG 
GGAAGGGGCT CCGGAGTGAC GCGGGACCCG 
GGAAGCGCGG AGCCTGAGCG GGGGTACCCG 
TCTGAGCCGG TGTCTCCCCG AGGGAAAGGG 
CTGTGGGGTC GGTCCCGAGA CGCGGGGCTC 
20 AGCGCCTCCC TCTCTCAGCA TGGACGAGGA 
TGCGCAGCTG CAGCAGGTGG AGCTGGCCTT 
TGACCTGCGC CAGCTGCAGG GGGACCTGAA 
GGTGTCTGTC AGGAAGAGCA GGTTGTTGGC 
GGAAGATGCT GAGTACCAGG CTTTCCGGGA 
25 AGCGGCCCGT GGGTCCGGAT CAGAGACCGT 
GGCAGGTGGG CAGGAGGAGG AAGAGGGAGA 
GAGCGCGCCC TACTACAGCT CCTGGGGCAC 
AACGGAAGAG GCGGAGGATG GCTCGGCGGG 
CAAGTCTCTG AAGCCGTGCC CGTTCTTCCT 
30 CAGGTAAAGC CCTTTGTTGT CAGATGCCAA 
GTCGGGTCAG GATCGGCCCT CCCTTTGCTT 
CGTGCTTTGA CAGCTGTGTC TGTGTTCAGG 
GAAGCATGGA AGGAGGCCAG GCGCGGTGGC 
CCGAGGCGGG CAGATCACCT GAGGTCAGGA 
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CTCGCTGCAG 


CCCCGCCTGG 


GCCACGGCAC 


96540 


GAGCAGCGAG 


CCACCGGCGC 


GCTCCCCAGG 


96600 


CATCACCCCA 


TTTCTTAAAA 


CGGGGACGGC 


96660 


GCAGCGGCGG 


GGCTGCTCCT 


GGAGTGCACC 


96720 


CTGCGAAGGA 


AGAACCTGGG 


TCGCGGCCCC 


96780 


AGGTCTGAGG 


CTCCCCGACA 


AGCAGCCAAA 


96840 


CGAGTCCTGG 


GCCGCCCCTC 


GGGCCTCGCG 


96900 


TCCGCCGCGC 


CCTCGGAGCC ' 


GCTGCTGCTG 


96960 


AACCCCGCGC 


GGCGCCTGAC 


GGGACGCGGG 


97020 


CCAGCGGCCT 


CCGCGCTCCG 


CTTCCCGGCT 


97080 


GCGGCGCTCG 


GGAGGAAGTG 


CCGATCGGCT 


97140 


CTAGCCGGTG 


AGGCCGGCGG 


GCTCTCTGTG 


97200 


GCCCGGCCGG 


GGACCAGGTG 


GCCGCGGTTT. 


97260 


GACGCCGGGC 


GGGGGGGCGC 


GGCCGGCCTG 


97320 


CGGAGGGTGA 


GGCGGCGGGG 


CGCCCCGCGG 


97380 


GCTAGCGGCG 


AGCCCACGGC 


GGCTCGGAAG 


97440 


GGCTGCGACC 


TCTGCGCTGG 


GAGCTGTGCG 


97500 


GACGTGCCCG 


TGCCCGTGCC 


CGCCCTCAGG 


97560 


AGCTGGCTTC 


TCTTCTTGCA 


GCCCTGGTCC 


97620 


GAGCCTGGAG 


TCGGCCTTGC 


AGACCTACCG 


97680 


GGGCGCCGGC 


CTGGATTCGT 


CTGAGCAGGC 


97740 


GGAGCTCATC 


GAGCTCACCG 


AGGCCAGCCT 


97300 


CGCGCTGGAC 


GAAGAGCGCC 


CGGGCCGCCA 


97860 


GGCCATCACT 


GAGGCGGTGG 


AGGCACCAGC 


97920 


TCCTAAAGCA 


GAGGCGGGGC 


CAGAATCTGC 


97980 


GGACGAGGAA 


GAGCTGAGTG 


GGACAAAGGT 


98040 


TCTGGAGTAT 


CACAACGCCA 


TGGTGGTGGG 


98100 


TGTCCGTGTG 


CTTTACCTGT 


ACCCCACTCA 


98160 


GGAGGGAAAG 


TGCCGCTTTA 


AGGAGAACTG 


98220 


CCTTAGGGGC 


GTAAGGGGCA 


CGCACACAGG 


98280 


TGCAGTTTTG 


TCTCAGCTTC 


CTGGGGCAGG 


98340 


CGTCTACGTG 


TTCCTTCTGG 


GGTGAATCAA 


98400 


TCACGCCTGT 


AATCCCAGCA 


CTTTAGGAAG 


98460 


GTTCAAGACC 


ACGCTGGTCA 


ACATGGTGAA 


98520 
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ACCCCATCTC CTTAAAAACA CAAAAATGAA 
GGCTACTCAG GAGGCTGAGG CAGGAGAATT 
AGTGGAGATG CAGCCACTGT ACTGCAGCCC 
AGTAGGTCTC CAGCCTATCG TCAGTTTTTT 

5 AGGGTCTTAC TTTGTCAACC AGGCTGGAGT 
TGGACCTCCT GGGCTCAACC GAT.CCTCCTG 
CTCACGCTAC TACACCCAGC TAATTTTTAT 
ACGGAGTCTT GTTCTGTTGC CCAGGCTGGA 
GCTCCGCCTC CCGGGTTCAC GCCATTCTCC 

10 AGGCGCCCGC CACCATGTCT GGCTAATTTT 
ATGTTAGCCA GGATGGTCTC AATCTCCTGA 
AGTGCTGGGA TGACAAGCGT GAGCCACCGC 
TCACTCTCCT CACCCAGGCT GGAGTGTAGT 
CTCCCGGGTT GAAGTAATTC TCTACCTCAG 

15 GCCACCACAC TCGGCTAATT TTTTGTATTT 
CAGGCTGGTC TTGAACTCCT GACCTCGTGA 
GATCACAGGC GTGAGCCACT GCGCCTGGCC 
CTTTTAAATT TTACTTTCAT GGAGATTTTC 
AATCACCGCC CACGGTCCAT CATCCAATGT 

20 TTTGTCTGTT CTCCCTGCCC TATTTTTTTC 
AAGTTACTGG TTTTTTCCAA TTATGAATAC 
AGGAAATTAC TTAAAAAGTA ACTTTTTGGA 
GTAACTTCTG TGTCATAATA GGTAACACAT 
■ ATGAACAATT .ATTACTGTAA TGTCTCTATT 

25 TTTTTACTTT TCTCTTAGAA GAAATATTTA 
TAGCCAATAG TTCAGGCTGA CCGTGTAACC 
GTCTTCCTTT. TTTTTTTTTT TTGAAACAGC 
GCAGTGGCAC AATCTCGGCT CACTGCAATC 
CCTCAGCCTC CCTAATAGCT GATACTACAA 

30 TATTTTTTGT AGAGACGGGA TTTCACCATA 
GATGAGAGAT CTGACACATC CCTGTGGTGC 
TTCATATACT TTCATTCATC TGTGCAACAG 
CTGACAGAGG AGGAAACAGG AGTAGAAGAA 
GGCTCCAGTT GCCCCCAGCC CTCGTTCCGT 



CCGGTCGTGG TGGCGCGCAC CTGTGGTCCT 98580 

GGTTGAACCC AGGAGGCCGA GTTTGCAGTG 98640 

GAGCAGCAGT GCAAGGCTTA TGTGGAAGAG 98700 

TTTGGTGGTT GTTTTAATTT TTTTTGAGAC 98760 

GCAGTGGCAT AGTCCTGGCT CACTGCAGCC 98820 

CCTCAGCCCC ■ CCTAGGAGCT GGGCTACAGA 98880 

ATTACTATAA TTTTTTATCT TTTTTTTGAG 98940 

GTGCAGTGGC GTGATCTCGG CTCACTGCAA 99000 

TGCCTCAGCC TCCCGAGTAG CTGGGACTAC 99060 

CTGTATTTTT AGTAGAGACG GGGTTTCACC 99120 

CCTCGTGATC CGCCCACCTT GGCCTCCCAA 99180 

GCCTGGCCTT TTT.TTTTTGG AGACAGAGTT 99240 

GGCGCAATCT CAGCTTACCG CAACCTCTGT 993 00 

CGTCCAGAGT AGCTGGCATT ACAGGCGCCC 99360 

TTAGTAGAGT CGGAGATTCA. CCATCTTGGC 99420 

TCCACCCACC TTGGCCTCCC AAAGTGCTGG 99480 

CTGTTGTTAG TTTTATTCTC TAGAGTTCAA 99540 

AAACATACCC CAAATTAGAG AGTTTAGCAT 99600 

CGTCATTTAT TAATATTTTC CCAGTCTCAT 99660 

TTTCCTGGGC CATTTTAAAG CAAATTCCAG 99720 

TTCATAGTTG CATCTCTAAT CTAACTGATT . 99780 

AGTCCAAGTC CGATGTGAGG ACAAAAAAGA 99840 

TTAATGGTAA TACCTCTTGC ATATTCAAAT 99900 

TCCCTAAGCG. CATAGCTTTA TTTTTCCTCC 99960 

CCAAGCCTTC TAGTAGGTAA TTTTCTTTTT 100020 

ATCCCTAGTT CTAGTTCTAG TTCTTTGAAT 100080- 

GTCTTGCTGC TCTGTCACCC AGGCTGGAGT 100140 

TCCGCCTCCC TGGCCCAAGC CATCCTCCCA 100200 

GTGTGCACTG CCACGCCCAG CTAATTTTTG 100260 

TTACCCAGGT CTCGAATTCC TGATCCCTTT 100320 

TCCCTCTGGA CCAGGCACTG CTCCAAGGGT 100380 

CCCTGTAGGT AGGCCCTGCA GTCACACCAT 100440 

CTGAGTGGTC CAGGGCTTCA AGGCTCAGAG 100500 

CCCCTGCTCC ACCCAGTGCT GCTTGCCATG 100560 
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TCGGCATCAG GCCTGATCTG AAAGCTTCCG GAGCATCTTA CAGACGTCCA CCTTGCCACC 100620 
ATTCAGGACT GATAAGTTCT CTTGGATTTG CGTTGGACCT TTTTTTTTTT TTTAAGATGG 100680 
AGTTTCACTG TTGTTGCCCA GGCTAGAGTA CAATGGCACG ACCTCCACCT CCTGGGTTCA 100740 
AGGGATTCTC. CTGCCTCAGC CTCCCAAGTA GCTGGGATTA CAGGCGCCTG TCACCACGTG 100800 
5 GTGCCCAGCT AATTTTTATA TTTTTAGTAG AGGCAGGGTT TCACCGTGTT GGCCAGGCTG 100860 
GTCTCGAACC CTTGACCTCA GGTGATCCCG CCTTGGTTTC CCAAAGTGCT GGGATTACAG 100920 
GCATGAGCCA CCACACCCGG CCCAGGATTT CTTTATATAT TCTGGATATC ATCCCTTATG- 100980 
AAGTATATAG TTTGCAGATA TTTGCTCCCA TTGTTTGGGT TGTCTTTTCA CTTGATATAG 101040 
TGTCCTTTGA TGCACAAACA TTTTAAATTT TGATGCAGTG CAATTTATTG TTTCTTTATT 101100 
10 GCCTATGTTT TTGTCATCAG GTTTAAGAAA CCACCTCATC CATAGTTATG AGGATTTTCA 101160 
CCTATGTTTT CTTCTAAGAG TTCTGTAGTT TTAGCTGTTA AATTTAGGTC TTTGATCCAT 101220 
TTTGAGTTAA TTTTTGTATA TGTTATTAGG TGAGGGTCCA CTTTATTCTT TTGCATGTGG 101280 
ATTTCCAGTT TTCCCAGCAC CATTTGTTTA AAAGACTGCT TTTTCTCCAC TGAATGGTCT 101340 
TGGCACTTTT GTCCAAAATC AATTGGCAAT ATATGTAAGG GTTTATTTCT GAGCTCTCTC 101400 
15 TCCTGTTCCA TTGGTGTATA TGTGCCAGTA CCACACTGTT CTGATTATTA TAGCTTTGTG 101460 
ATAAGTTTTA AACTCAGGAA GTGGTAGTTA TTCACCATTT GCTCCTCTTT TTCAAGTTTG 101520 
TTTTGTTTCT GGATCCTTTG CAATTTCATA TGAATTTTAG GATCGGCTTG TCCAATTCTG 101580 
CATAAAAGAC AGTTTGAATT TTGATATGGA TTGCATAGAA TGTGTAGATC TGTTTGGGGC 101640 
ACATTGTCAT CTTTACAATA TTAAGCCTTC TGGCTGGGTG TGGTGGCTGA CGCCTGTAAT 101700 
20 CCCAGTACTT TGGGAGGCTG AGGCGGGCAT ATCACTTGAG GTCAGGAGTT CAAGACCAGC 101760 
CTGGCCAACG TGGTGAAACC CCGTCTCTAC TAAAAATAAA AAACAAATTA GTCGGAGGTG 101820 
GTGCACACCT GTAATCCCAG CTACAGGAGA GGGTGAGGCA GGAGAATCGC TTGAACCTGG 101880 
GAGGAGGAGG TTGCAGTGAG CTGAGATCAT GCCACTGCAC TCCAGCCTGG GTAACAGAGG 101940 
GAGACTCCAT CTTAAACAAC AACAATAACA GAAGAAAAAA ACAGTATTAA GTCTTCCAAT 102000 
25 TCATGAATGA AGGATCTGTC CATTTATTTA CGTCTTTAAT TTCTTTCAAC AGTATTTTGT 102060 
ACTGTTCAAG TCTTGCACAT TCTTGGTTAA ATAAGTATTA TTTTTGATGC TTCTCTAAGG 102120 
AATTGTTTTT CTTTTCCTTT TTTTTTTTGA GACAGAGTCT TGCTCTGTCA CCCAGGCTGG 102180 
AGTGCAGTGG CACAATCTTG GCTCACTGCA ACCTCTGCCT CCCGGGTTCA AGCAATTCTT 102240 
CTGCTCAGCC TCCCAAGTAG CTGGGATCAC AGGTGCCTGC CACCACACCC AGCTAATTTT 102300 
30 TTTTTTTGAG ATGGAGTCTT GCTCTGTTGC CCAGGCTGGA GTGAAGTGGC CCAATCTTGG 1023 60 
CTCACTGCAA GCTCCACCTC CCGGGTTCAC ACCATTCTTC CGCCTCAGCC TCCTGAGTCG 102420 
CTGGGAATAC AGGTGCCTGC CACCACGCCC AGCTAATTTT TTGTATTTTT- AGTAGAGATG 102480 
GGGTTTCACC ATGTAGCCAG GATGGTCTCG AACTCTTGAC CTCAGGTGAT CTGCCTGCCT 102540 
CGGCCTCCCA AAGTGCTGGG ATTACAGATG TGAGCCACTG TGCCCGGCTC GAGTTGTTTT 102600 
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CCTTAGTTAC ATTTTCAGGC TGTTTGTTGC 
GGCTCACGCC TGTAATCCCA GCACTTTGGG 
GGAGTTCGAG ACCAGCCTGG CCAACATGGT 
TTAGTCTGGC ATGGTGGCAG GTGCCTGTAA 
5 AATTGCTTGA ACCTGGGAGG CGGAGGTTGC 
GCTTGGGGAA CAAGAGTGAG ACTTCATCTC 
TTTTTATGTT AATCCTGTAT TGATTGCTGA 
ATGCACTATT TAGGATTTTC GATATATACA 
TATATGTGAA TAGAGATAAT TGTAGTCTTT 
10 . TTCTTGCTTA ACTGCCTTAG CTAGAACTTC 
CGGGCCGGGG GTGGTGGCTC ACACCTGTGT 
GATCACTTGA GATCAGCAGT TTGAGACCAG 
CTAAAAATAC AAAAATTAGC TGGGTGAGGT 
AGGGTGAGAC ATGAGAATTG CTTGAACCTG 
15 TGCCACTCCA CTCCAGCCTG GACGACAGAG 
AAAAGTGGCT AGAACAAACA TCTTTATCTT 
TTCCTGATAT TAGGTGGAAA ACTTTTAGTC 
GTTTTCTGTA GATTCCCTTT ATCGAGTTGA 
GTGTTTTTTA TCATGAAAGG GTGTTGATTT 
20 CCCAGGCTGG AGGGCAGTGG CATGATCATG 
GGGGATCCTC CTACTTCATC CTCCTGAGTA 
CAGCTAATTT TTTAATTTTT CTGTAGAGGT 
TAAACTCCAG GGCTCAAGCA ATCCTGCCTC 
GAGTCACTGC ACTGCACCCA GCTGTGTGGG 
25 TCATGTGTGG TTTTTTTCCT TTCATTTTGT 
ACCATCCTTG AATTCCTCAG ATAAAGCACG 
TATTTTTTTT GTAGAGATGA GATTTCACTC 
CTAAAGTGAT CCTGCTGCCT CAGCCTCCGA 
GGCCCTATCT TTTTTCTTTT TCTTTTTTTT 
30 CCGGGCTGGA GTGAGTGGCG CGATCTCGGC 
CAATTCTCCT GCCTCAGCCT CCTGAGTAGC 
GCTAATTTTT TGTGTTTTTA GTTGAGACGG 
CACTCCTGAC CTCGTGATTC ACCCACCTTG 
AGCCACCGCA GCGAGCCTTA. TCTTTTTAAC 



TAGTATATAG AAATACAAGC TGGGCACCGT 102660 
AGGCCAAGGC GGGTGGATCA CCTGTGGTCA 102720 
GAAATCCAGC CTCTATTAAA AATACAAAAA 1027 80 
TCCCATCTAC TCAGGAGGCT GAGGCAAGAG 102840 
AGTGAGCTGA GATCGCGCCA TTGCACTCCA 102900 
AAAAAAAAAA AAAAAGAAAT ACAGTGGATT 102960 
ATTGGTTTAT TAGTGCTAAT AGGATTTTTT 103020 
ATCATATATA TTCAATATAT ACAATTAATA 103080 
GTTTCTAGTT TGCATGGCAT TTATTTCTTT 103140 
AAGTACGATG TTGAATAAAA GTGACTAGAG 103200 
TCCCAGCACT TTGGGAGGTG GAAGTGGGCA 103260 
CCTGGCCAAC ACGGCGAAAC CCCATCTCTA 103320 
GATGTGCACC TGTAGTCCCA GCTACTTGAG 1033 80 
GGGGGCGGAG GTTGCAGTGA GCCAAGATCA 103440 
CAAGAACCCT GTCTTTAAAA AAAAAAAAAA 103500 
GTTCCTGATC TTAGGTGGAA AACTTTTTTG 103560 
TTTCACTGTT GAATATGATG TTACTTGTAG 103620 
GGAAATTCTC TTATATTCAT AGTGTGTTGA 103680 
TTTTTTTAAA GATAGGGTCT TGTTCTGTCA 103740 
GCTCACTGCA ACCTCGAATT CCTGGGCTCA 103800 
GGTGAGACTA CAGGCATGAG CCACCATGCC 1038 60 
AGGGTCCTGC TTTGCTGCCC AGGCTGGTCT 103920 
AGCCTCCCAA AGTGCTGAGA TTACAGGGGT 103980 
ATTTTTCAAA TGCTTTTTTC CTTTAGATGA 104040 
TAATGTGGTA TATTGATTTT CGTATGTTGA 104100 
CATATTCATG GCGTATTATC TCTTTATTAT 104160 
TGTTGCCCAA GCTGGTCTCA AACTCCTGGG 104220 
AAGCGCTGGG ATTATAGGCA TGAGCCACTT 1042 80 
TTTTTTTTGA GACAGAGTCT CACTCTGTCG 104340 
TCACTGCAAC CTCCATCTCC CGGGTTCAAG 104400 
TGGGACTACA GGTGCCCGCC ACTATGCCCA 104460 
TGTTTTGCCA TGTTGGACAG GCTGGTCTTG 104520 
GCCTCCCGAA GTGCTGGGAT TACAGGCATG 104580 
AGTTAAAAGT TTAAGGCCTT ATCATGTAAT 104640 
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AACATTGCTG GATTTGATTT GCTGCTGTTT 
GGGATATTGG TCTGTAGTTT TCTTTTCTTG 
TAATATTGGC CTCATAGAAT GAGTTAGGAA 
AGGGATTGGT GTTAATTCTT CTTCAAATGT 
5 ACAATCATAT ATATAGAGAG AGAGAGAGAG 
ATTGACTATT GATTCAATTT CCTTATTGAA 
ATAACTGTAG TGAAAGTTTC TGAACTTTTC 
TTTATTTGTT ATAGGTCTAT TCAGATTTTC 
CTGTCGCCCA GGGTGGAGTG CAGTGGTGCC 
10 AGTTCAAGTG ATTCTCCCAC CTCAGCCTCC 
CATACCTGGC TAATTTTTGT ATTTTTAGTA 
GGTCTCGAAC TCCTGACCTC AGGTGATCCA 
ACAGACATAA GCCACCGCGT CCAGCCTTGA 
TAGGAATTTG TCCATTTTGT TTATGTTATC 
15 TCCTTTTATA GCCCTAGTTA TTTCTGTAAG 
GGTTTTAGCA ATTTGAGTCA TCTCTTTTCT 
CACTGTGTCA CCCAGGCTGG AGTGCAGTGG 
CCCAGGTTCA AGCAATTCTG CCTTAGCCTC 
CCACACCCAG CTAGTTTTGT TTTGTTTTTT 
20 CGAGGCTGGA GTGCAGTGGT GCAATCTCAC 
CAATTCTCCT GCCTCAGCCT CCCGAGTAGC 
GCTGATTTTT ATATTTTTAG TAGAGATGGG 
ACTCCCTACC TGAGGTGATC CGCCCACCTT 
GAGCCACCAT GCCCAGCCAG TTTTTGTATT 
25 CCAGGCTGGT CTTGAAATCC TGACCTCAGG 
TAGGATTACA GGCATGAGCC ACCACGCATG 
AAAGGTTTGT CAATTTTGTT GATCTTTTTT 
TTTCATTTAT TTCCATTTTA ACCTTTGTTT 
CTCTTTTTTT CCCCTAATTT TTCAAGGTAT 
30 TTTCTTTTCT TTTTTTTTTT TTTTTTTTTT 
TGTCACCCAG ACTGGAGTGC AGTGGCATGA 
GCGATTCTCC TGCCTCAGCC TCCTGAGTAG 
ATGTAAGCAT. TTACAGCTAC AGATTTCCCT 
ATTGTTTTTT GTTGTTTTTT GTTGTTGTTT 



TGTTGAGAAT ATTTGCATCT GTATTGATAA 104700 
GCATGTCTTT GTATAGCTTT GATGCCAGCA 104760 
GTATTCTTTA TATTATGGGA AGAGGTAAAA 104820 
TTGATAGAAT TCAACAGTGA AGTGATATAT .104880 
AGAGATGGAC TTTTCTTTTG TTGGAAGTTT 104940 
ATTGACTTTT CTTTTTGGAA GCTAAAATGT 105000 
TTTCATTGGA AGTTTTTTGA CTACTGATTC 10$ 060 
TGTTTCTTCT TGAGTCAGTT TGGTCTCGCT 105120 
ATCTTGGCTC ACTGCAACTT CTACCTCCCG 105180 
CCAGTATCTC GGACTACAGG CGCACGCCAG 105240 
GGAACAGCAT TTCACCATGT TGGCCAGGCT 105300 
CCCGCCTCGG CCTCACAAAG TGCTGGGACT 105360 
GTCAGTTTAG ATAGTTTGCA TGCATGTTTC 105420 
TAATCTGTTA CCATACAATT GTTCATAGTA 105480 
ATCAGTAGTA ATAGCTCCAC TTTCTCTCTT 105540 
TCTTCTTTTT TTTTTTTTGA GATGGAGTCT 105600 
CATGATCTTG GCTCACTGCA ACCCCTGCCT 105660 
CTGAGTAGCT GGGATTACAG GTGTGAGCCA 105720 
TGTTTTTGAG ACGGAGTCTG TTTCTGTCTC 105780 
TCATTGCAAC CTCCGACTCC CAGATTCCAG 105840 
TGGAACTATA GGCGTGCACC ACCACGCCTG 105900 
ATTTCACCAT. GTTGGCCAGG CTGGTCTTGG 105960 
GGCCTCCCAA AGTGCTGGGA TTATAGGCAT 106020 
TTTAGTAGAG ATGGGGTTTC TCCCTGTCGG 106080 
■TTATCCACCA GCCTTGGCCT CCCAAAGTGC 106140 
GCCTGTCTTT TCTTCTTGGT CATTTTCGCT 1062 00 
GTTGCTGATC TCTATTGTTT TCCCATTCTG 1062 60 
CCTTTTTTCT GCTGGTTTGG GTTTAATTTG 106320 
ACAGTTAAGT TATTGATTTG AGATCTCTTT 106380 
TTTGGTTGCT GTTGAGATGG AGTCTCCCTC 1064 40 
TCTCAGCTCA CTGCAGCCTC CGCCGCCCAG 1065.00 
ACGTTTCCCG GCCAAGGTGT TTCTTTTTGA 106560 
CTAAACACTG CTTTCACTGC ATTCCATAAG 106620 
TGTTGTTTGA GACACAGTCT CACTCTGTTG 106680 
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CCGTTTGGAG AGCAGCGATG CGATCATAGC 
GTCCTCCTGC CTCAGCCTCC CAAGTAGCTG 
TAATTTCTTT TATAAGTTTT TGCAGAGGCC 
CACTTTGGGA GGCCAAGGTG GGTGGATCAC 
5 CGACAGGGAG AAACCCCATC TCTACTAAAA 
TGCCTGTAAT CCCAGCTACT CAGGAGGCTG 
AGAGGTTGCA GTGAGCCAGG ATCACACCAT 
CTCCATCTCA AGAAAAGAAA AAAAAAAGTT 
CAGGCTGGTC TCAAACTCCT GACTTGAAGG 
10 . GAGATTATGG GCAAGAGCCA CCGCACCCTG 
CATTTTCATT GATCTCAAGA CATCCTAATC 
TTATTCAAGA GTGTCTTTAT TTCTGCATAT 
TTTTTTTTTT GAGAAAGGGT CTTGCTCTGT 
CTTGCCTCAC TGCAACCTCT GCCTCCCGGG 
15 AGTAGCTGGG ATTACAGGCA CACACCACCA 
ACGTGCTGGT CAGACTGGTC TCGAATTCCT 
CCAAAGCACT GGGATTACAG GCGTGAAACA 
ATAGAGAGAA GGTCTCACTC AAGCCCAGGC 
TCCCTCCTCG GCCTCCCAAG GTGCTGAGAT 
20 TTTCAAGACT TTAAAAATGC CATCTTGGCT 
CACTTTGGGA GGCCGAGGTG GGCAGATCAC 
AC ATGGTGAA ACCCTGTCTC TACTAAAAAT 
GCCTGTAGTC CCAGCTACTC GGGAAGCTGA 
GAGCTTGCAG TGAGCTGAGA TCACACCACT 
25 CCGTCTCAAA AAAAAAAAAA AAAATGTCAT 
TGAGAAATTG GCTGTTAATC TTATTGAGGA 
TTGCTGTTTT AGGAGATTCT CTATCTTTGG 
GTGGATCCCT CAATTTATAA GCTACTTGGA 
ATGTCTTTCA TTAAATTTGC AAAGTTTCAG 
30 TTTGTTTCTT TCTGTCTGTT TGCCGCTTAT 
TGGTGTAGCA CAGGTCCCTT GGGCTCTAGG 
TCCTCATTTT GGATAAATTC AGCTGACCTG 
TTCTCAAATC TGCTGTTGAA ACTTCTGGTG 
GCTCCAAAGT TTCTATTTGG TTTCTTTCTG 



TCTGTAGCCT TGAGCTCCTG GACTCAATCA 106740 
GGACTACAGG TGTACACCAC TGCACCTAAC 106800 
AGGCACAGTG GCTCACACCT GTAATCCCAG 106860 
CTAAGGTCAG GAGTTCGAGA CCAGCCTGGC 106920 
ATACAAAAAT TAGCTGGGCG TGGTGGCAGG 106980 
AGGCAGGAGA ATCGCTTGAA CCTGGGAGGC 107040 
TGCACTCCAG CCTGGGTAAC AAAAGCAAAA 107100 
TTTGCAGAGA CAGGGTATCA CTTTGTTGCC 107160 
AGTCCTACTG CCTCAGCCTC CCAAAGTGCT 107220 
CCACTTGGCT GTTTTGTTCT GTTGTATTTC 107280 
TCCCTTTTGT TTTTTTGTTC GACTTACTGG 107340 
TTGTAAATTT TCCAAAAAAG TTTTTCTTTC 107400 
CGCCCAGGCT GGAGAATGGT GGTGCACAAT 107460 
TTCAAGTGAT CCTCCCACCT CAGCCTTCCC 107520 
CACCTGGCTA ATTTTTGTAT TTTAGTCTTA 107580 
GACCTCAGGT GATCTGCCCG CCTTGGCCTC. 107640 
CCATGCCCAG CCCCCAATTT TTTTTTTTTA 107700 
TGGTCTTGAA CTCCTGAGCT CAAGCTGTCA 1077 60 
TACAGGTGTG AGTCACAGTA CCTGGCCTTC 107820 
GGGCACGGTG GCTCACGCCT GTAATCCCAG 107880 
GAGGTCAGGA GATCAAGACC ACCCTGGCTA 107 940- 
ACAAAAAATT AACCAGGTGT GGTGGCAGGT 108000 
AGCAGGAGAA TGGCGTGAAC CCGGGAGGTG 108060 
GTACTCCAGC CTGGGCAACA GTGCGAGACT 108120 
CTCACTGCCT TCTGGTCCAA TAGTTTCTGA 108180 
ACATTTATAT ATTGACTAGT CACTTGTCTC 108240 
GTTTCAGCAG TTTGATTATA ATGTATCAGT 1083 00 
GTTCATTGGA CTTCTTGGAT GTGTAAATTC 1083 60 
CTACTATTCT TTGCATCTTG AAATACTAGT 108420 
GGAACTTTAT GCATACATTG ATGTGCTTCA 108480 
CATTTTTCTT TGTTCTTTTT TTCTTTCTGC 108540 
TCCTCAAGTT CACTGTTTCT TTCTTCTTCC 108600 
AAATTTTCAC TACAGTTACT GTACTTTTTA 108660 
TAGTAATTAT CACTTTACTA GTATTCTCTA 108720 
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TTTGGTTAGA CATGGTTCTT TTGTTTTCCT 
TTAAATTTCT TTTTATTTAG TTATTAATTT 
GTCACCCAGG CTGGCAGGCA ACGTCACAAT 
TTCAAGTGAT TCTCCTGCCT CAGCCTCCCA 
5 CACCCACCTA ATTTTTGGTA TTTTTAGTAG 
GTCTTAAACT ACTAACCTCA GGTGATCTGT 
CAGATGTGAG CCACTGTGCC CAGCCTCTTT 
GTTTTTGTCT AGTCATTCAA ATGTCTAGGC 
TATTTTTAAA AAATTTTTTT TAATTTTCTT 
10 CTAGGCTGGA GTGCAATGGC TTGATCTTGG 
GCGATTCTCC TGCTTCAGCC TCCTGAGTAG 
GGCTAATTTT TTGTATTTTC AGTAGAGACA' 
GATCTCGTGA CCTCATGATC CTCCTGCCTC 
GAGCCACCGC GCCCAGCCTA TTTTTTATTT 
15 GCTGTAGTGC AGTGGCACAA CCCTGGCACA 
GTCTCCCACC TTAGTCTCCT GAGTAGCTAG 
GGTTGTGTTG TTACTGTTTT AGACACAGGG 
CTCCTGGGCT CAAGCAGTCA TCCCACCTTG 
AGCCACCGCA CCCGGCCTGT TAATTTCTTT 
20 TTCTTTGGAT GCCTTGTAAT TTTTTGTTGA 
TTACTTTGAA AATCAGATCC TCCGCCCTCT 
TTGTCGTTTC TCGTTTGTTT AGTTACTTTC 
TCAGGGGTGC TTGTTTCTGT TCTTTTAGGT 
TCTTTAAATA TCTAGTGGCA AAAAGGATAA 
25 ATGCTAGGAC TTTGGGAAGT GGAGGCGGGT 
GCCTGGCCAG TATGGTGAAA CCCTGTCTCT 
TGGCACCTGC CTGTAGTCCC AGCTACTGGG 
AGGGGGCGGA GGTTGCAGTG AGCTGAGATT 
GCGAGACTCT GTCTCAAATA AAAAAAAAAA 
30 CCAGGTTGCC TCTGTACTGG GGCAAGTCCT 
CCTCAGCCTT TACTTCTCGC TCCCATGGAG 
GGCCTCCTCA GGTGTTTCTG AGCCCCTGTC 
TTTCCAAGCA TGAACAGGAG CTTTCCAAAG 
GCCTCTTCCT TCCTAGGCTT TTCTGTCAGC 



TTAGTTCATT ATCCATGGTT TCCTTTATTT 108780 
TTTTTTTTTT TGAAGCGGGG TTTCACTCTT 108840 
CTTGGCTCAC TACAACCTCC GCCTCCTGGG 108900 
AGTAGCTGGG ATTATAGGCA TGTGCCACCA 108960 
AAACTGGGTT TCACCACATT GGCCAGACTG 109020 
CCGCCTCAGC CTCCCAAAAT GCTGGGAT.TA 109080 
TTTTAGTGTA TTTAAGGTAA TTGATTGAAA 109140 
TTCCTCAGGA ACAGTTTCTA TTAATTTCTT 109200 
TTTTTTTTAG ATGGAGTCTC ACTCTATAGC 109260 
CTCACTGGAA CCTCTGCCTC CTGGGTTCAA 109320 
CTGGGACTAT AGGTGCGTGC CACCACTCCT 109380 
TGGTTTTGCC GTGTTAGCCA GGATGGTCTC 109440 
GGCCTCCCAA AGTGCTGGAA TTACAGGTGT 109500 
TTTGAGACAA AGTCTCCCTC TCTCACCCAG 109560 
CTGCAGGCTT AACCGTCCAG GCTTAAGTGA 109620 
AACTACAAGC ATGTGCCACC ATGCCTGGCT 109680 
TCTTGCTACA TTTCTCTGAC TGGTCTTGAA 109740 
GCCTCCCAAG GTGTTGAGAT TACAGGTGTG 109800 
ATTTCCGGTG AATGGGCCAC ACTTTCTTGT 109860 
AACCTGCACA ATTTGAAGAT GATAATGTGG 109920 
GCAGGGTTCA TTGTTGCTGT TTGTTGTGGA 109980 
CTGACCTTTT TAAATAAAGA CTATATTCTG 110040 
TAGTGGTTAG CTTGTGCTTT GAAAGAGATT 110100 
AGAGGCCGGG CGCAGTGGCT CACGCCTGTA 110160 
GGATCACTTG AGGTCAGGAG TTTAAGATCA 110220 
ACTAAAAATA CAAAAATTAA CCGGGCATGG 110280 
AAGACTGAGG CAGGAGAATC GCTTCAATCC 110340 
GCGCCATTGC ACTCCAGCCT GGGCAACAGA 110400 
AAAAAGGATA AAGAGTGTCT TCCATCCTTT 110460 
TCAGTGTCCG CCAGGCTGTT CACGGCTTTT 110520 
CCTAAGGATG AACCAGAGGT GAAAGTTGAG 110580 
TAGCCCCAGC TGTGTGCATG GCCTTCTGGA 110640 
CCCTTAGACC TTCATGTAGC TGTTTTCCCA 110700 
TCTTTGCCCA TCTGTTGTTG TCCCTCCCCC 1107 60 
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ACAACTTCAG GTAGTATCTA CCTGTAAATG CCTTCAGGCC AGGCGCGGTG GCTCATACCT 110820 
GTTATCCCAG CACTTTGGGA GGCCGAGGCG GGTGAATTGC TTGAGGTCAG GAGTTCGAGA 110880 
CCAGCCTGGC CAACATGGTG AAGCCCCGTC TCTAGTAAAA ATACAAAAAT TAGCTGGGCG 110940 
TGGTGGGTGC CTGTAATCTC AGCTACTCGG GAGGCTGAAG CAGGAGAATT GCTTGAGCCT 111000 

5 GGGAGGCGGA GGTTGCAGTG AGCTGAGATC GTGCCATTGC ACTCCAGCCT GGGCGACAGA 111060 
GTGAGACTCC ATCTCGGGGA AAAAAAAAAA AAAAAAATGC CATCAACAGC ACGACCCTGG 111120 
AGGCTGCCCC AGCCCTGAGA GAGTTCGAGG GGGTGAAACA AAGGCAAGCC CTTCAGGGAG 111180 
ACACTAGAAA GATCCAAATG CATAAGCAGG . ATTCCTTGAG AAAAGGTCTG TATCATCCCT 111240 
TCTGACACCA GCAAGCCACA TCAGAAATAC AGGTTGCCTT CCCCATGGCT ACATGTGAGC .111300 

10 TGGTAGTAGT GGCTGAGCAG AAATAGCCCA GCTGTCCTCC TGAAATTTAG CAGGGTCTTA 11.1360 
CTTCATTGAG CAGTCATCTG GTTCGTAGAC ACCAGAGTTA CAGAAAAGTT TATTGGGAGG 111420 
TTTTGACAGT TTAATAGAAA AAAGTTTATT GTGACAGTTT TGACAGCTGA AT AG AAAAAA 111480 
GTTTACTGTG ACAGTTTTGA CAGCAGAATA GTTGCTTTGC TGGAGAGACG GATCTTTGGA 111540 
GCTGCCAACT CCATCATTTT GGTGATATCC AGCTCTGTTG CTGAATTTTT AGCTATGCTG 111600 

15 TTTTAAGTTA TTTTCTTAGT GGTTGCTCTA GAGATGACAA TGTGCATCTT TAACTTACCA 111660 
CAATGTACTT CAGATTATTA CTAACTTAAC ACTTAAAGTA CAGCATTTTT TTTTTTATGG 111720 
AGTTTCACTC TGTCACCCAG GCTGGAGTGC AATGGTGTGA TCTCGGCTCA CTGCAACCTC 111730 
CGCCTCCCAG GTTCACGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG GACTACAGGC 111840 
ACCCCCACCA CACCCGGCTA ATTTTGTATT TTTAGTAGAG ATGAGGTTTC ACCATGTTGG 111900 

20 TCAGGCTGGT CTCGAACTGC TGACCTCAGG TGATCCGCCC ATCTTGGCCT CCCAAAGTGC 111960 
TGGGATTACA GGTGTGAGCG ACTGCACTGA GCCTAAGTAT GGCAACGTGT CTATAACATA 112020 
GATCTACTTC CGTTGTACTA TGACATAGTT CCCCCTCCAT TTTCCTATAG CACAGTCCCA 112080 
ACCTCCCTTT TCCTCTGACA TAGTTCCATC : CTCCCTCCTC CTATGACGTC CTCCCTTCTC 112140 
CTCTGGCATA GCTCCATCCT CCCTTCTCCT ATGACACAGC TCCATCCTCC CTTCTCCTCT 112200 

25 GACACAGCTC CATCCTCCCT TCTCCTATGA CACAGCTCCA TCCTCCCTTC TCCTCTGACA 112260 
TAGCTCCATC CTCCCTTCTC CTATGTCATA GCTCCATCCT CCCTTCTCCT CTGACACAGC 112320 
TCCATCCTCC CTTCTCCTCT GGCATAGCTC CATCCTCCCT TCTCCTATGA CACAGCTCCA 112380 
TCCTCCCTTC TCCTATGACA CAGCTCCATC CTCCCTTCTC CTATGACACA GCTCCATCCT 112440 
CCCTTCTCCT ATGACACAGC TCCATCCTCC CTTCTCCTCT GGCATAGCTC CATCCTCCCT 112500 

30 TCTCCTCTGA CATAGCTCCA TCCTCCCTTC TCCTCTGACA TAGCTCCATC CTCCCTTCTC 112560 
CTCTGACATA GCTCCATCCT CCCTTCTCCT CTGACATAGC TCCATCCTCC CTTCTCCTCT 112 620 
GACATAGCTC CATCCTCCCT TCTCCTCTGA CATAGTTCCA TCCTCCCTTG TCCTCTGACA 112680 
TAGCTCCATC CTCCCTTCTC CTCTGACATA GCTCCATCCC CTCTTCTCCT TCATGTATTA 112740 
TTGCCATATA TACATTTATG TATGTTATAA CTTCAGCTCT ■ TCAGGGTTAT AATTATTGCT 112300 
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TCAAAAGTAT TTTGAAAGAA GTTGCCTGGA GGCAGTGGCT TATGCCTTTA ACTCCAGC AC 112860 
TTTTGGGGGC TGAGGTGGGC. AGATCGCCTG AGCCAGGGAG TTGGAGACCA GCCTGGGCAA 11292 0 
CATGACGAAA CCCATCTCCA CCAAAATTAC AAAAAATTAG TCTGGCATGG TGGCACGCGC 112980 
CTGTAGTCCC AGCTATTTGG GGGAGGATCC CAGCTAAGGT GGGAGGATCA CTTGAGCCTG 113040 

5 ' GGAAGTCAAG GCTGGAGTGA GCTGAGATTG TGCCACTGCA CTCCAGCCTG GGTGCAGATC 113100 
TTATCTCAGA AGTAAAGGGA CTAGGAATGG TGGCTTTTAT CTCTAATCCC AGCACTTTGG 113160 
GAGGCTGAGG TGAGTGGATC ACCGGAGGTC AGGAGTTTAA GACCAGCCTG GGCAACATGG 11322 0 
TGAAACCCCG TCTCTACTAA AAATACAAAA AGTAGCCGGG TGTGGTGGTG GGTGTCTGTA 113280 
ATCCCAGCTA CTCGGGAGGC TGAGGCAAGA GAATCGCTTG AACCTGGGAA GCGGAGGTTG 113340 

10 CAGTGAGCAA GATCGCACCA CTGCATTACA GCCTAGATGA CAGAGCGAGA CTCTGCCTAA 113400 
AAAAAAAAAA AAAAAGAAAA GAAAAGAAAT TAAGATCTAG ACACTGTGGT TCATGCCTGT 113460 
AATCCCAAAG CCTTGGGAGG CCAAGGCAGG AGGATCACTT GAGGCCAGGA GTTGAACACC 113520 
AGCCTGGGCA ACATAGCGAG ACTCCATCTC TATTTAAAAA AGAAAGAAAT TCAAAGAGAA 113580 
AAAAAGTATA CTTGTTTTTT TGTATCATCC ATATTTTACC TTTCTTTTTT TTGCCCCTTT 113640 

15 TTCTTTCCTG TGAATTTGAG TTACTGTCTA GTGTCATTTC CTTTTAGTCT GAAGAACTTC 113700 
ATTTAGAATT TTTTTTTTTT TTTGAGACAA AGTCTCACTG TGTTGCCCAG GCTGGAGTGC 113760 
AATGGTGCAG TCTCAGATCA CTGCAACCTC TGCCTCCCTG GTTAGAGTGA TTTTCCTGCC 113820 
TCAGCCTCCC AAGTAGCTGA GACTGCAGGC ACCTGCCACC ACCCCCAGCC AATTTTTTTG 113880 
GTATTTTTAG TAGAGACAGG GTTTCACTAT GTTGGCCAGG CTGGTCTCGA ATTCATGACC 113940 

20 TCATGATCTG CCTGTCCTGG CCTCCCAAAA TGCTGGGATT ACCATGAGCC ACCACGCCCA 114000 
GCCCATTTAG AATTTCTTTT TTTTTT TTTT TTTTGAGATG GGGTCTCGCT CTTGTTTCCC 114060 
AGGCTGGAGT GCAGTGGCAC GATCTCGGCT CACTGCGAGC TCCGCCTCCC GGGTTCACGC 114120 
CATTCTCCTG CCTCAGCCTC CCGAGTAGCT GGGATTACAG GCGCCTGCCA CCACGCCCAC 11418 0 
CTAATTTTTT GTATTTTTAG GAGAGATGGG GTTTCACCAT GTTAGCCAGG ATGGTCTTGA 114240 

25 TCTCCTGACC TCGTGATCCG CCCGCCTTGG CCTCCCAAAG TGCTGGGATT ACAGGCGTGA 114300 
GCCACCGCGC CCGGCTAGAA TTTCTTGTAG GACAGGCTTG CTAGCAACCA ATTCAGTGTT 114360 
TATTTGGGAA TGTCTTTATT TCAGCTTCAT TTTTTGAAGG ATAGTTTAGC TGGCTATAGA 114420 
ATTATTAATT GATCATTCTT TTCAGTGTTT AAAAGTGTCA TCATGCTACC TTCTGGGTTC 114480 
CATTGTTTCT GATGAGAAGT CATCTGTCAA ATTGTCCCTT TGTACTTGAA GAATTATCTT 114540 

30 TTTTTCTCTT GATGTTTTCA AGATTTTCTC TTTGTCTTTG GCCTTTAGTA GTTTGTGATG 114600 
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TATCTAGGTG TGGATCTCTT GGTGTGCATC GTATTTGGGC TTCAGTAAGC CTCTTAGATT 114660 

CATAGATTAA TGTTTTGTTT TGTTTTACCA AATTTGGAGA GTTTTTACTC ATCATTTCAA 114720 

CAAATTTTTT TCCTGCCCCT CTCTCATCTC CTTTTGGGAG TACCACTGCA TGTATGTTGG 114780 

TGTGCGTTCT CTA (SEQ ID NO : 3 ) ' 114793. 

5 The present invention also relates to a portion of SEQ ED NO:3 which comprises 5' . 

regulatory regions, exons, introns and 3' non-translated regions which comprise the human 
NHL gene of the present invention. Such regulatory sequence may be found within the 
various regions of this 115 kb fragment. The 5'portion of SEQ ID NO:l begins at nucleotide 
47095 of SEQ ID NO:3, the initiating ATG of human NHL is from nucleotide 48687-48689 

10 of SEQ ID NO:3, the termination TAG 1 codon is from nucleotide 84855-84857, while the 3' 
terminus of SEQ ID NO:l as disclosed herein (GCAGTGCCC) corresponds to nucleotides 
85308-85316. To this end, one preferred aspect of the invention is an isolated genomic 
fragment or fragments which comprise from about nucleotide 470000 to about nucleotide 
85500 of SEQ ID NO:3), which comprises the portion of the genomic clone encoding the 

15 mRNA transcript responsible for human NHL (see Figure 5 A-B). The genomic sequence 
encoding NHL contains 35 exons (Figure 5A). An especially preferred aspect of the 
invention^ a human genomic fragment or fragments which comprise from about nucleotide 
47095 to about nucleotide 85316 of SEQ ID NO:3. As noted in regard to SEQ ID NO:l, the 
present invention also relates to DNA vectors and recombinant hosts which comprise at least 

20 a portion of SEQ ID NO:3. Portions of the 1 15 kb genomic fragment may be housed in 

multiple vector/hosts so as to optimize handling of the DNA sequences within SEQ ID NO:3. 
Therefore, the present invention relates to the isolated genomic sequence which set forth as 
SEQ ID NO:3, a region of SEQ ID NO:3 which contains the coding and non-coding region pi 
human NHL, as well as a\s-acting sequences within SEQ ID NO:3 which effect regulation of 

25 transcription of one or more of the genes localized within this 1 15 kb human genomic . 

fragment, including regulatory regions effecting levels of NHL, M68/DcR3, SCUP and ARP. 
As noted above, this region of chromosome 20 (20ql3.3) is associated with tumor growth. 
Therefore, an aspect of this invention also comprises, as one example, the use of one or more 
regulatory regions of this 1 15 kb genomic sequence as a target to antagonize the effect of a 

30 transcriptional factor(s) which normally upregulate expression of a gene which has a caustic 
role in tumor growth. Alternatively, compounds may be selected which interacts with a 
specific cis-acting sequence to upregulate a gene within this region, where upregulation 
results in a decrease in tumor growth. 

The present invention is also directed to methods of screening for compounds 
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which modulate the expression of DNA or RNA encoding a NHL protein. 
Compounds which modulate these activities may be DNA, RNA, peptides, proteins, 
or non-proteinaceous organic molecules. Compounds may modulate by increasing or 
attenuating the expression of DNA or RNA encoding. NHL, or the function of the 

5 NHL-based protein. Compounds that modulate the expression of DNA or RNA 
encoding NHL or the biological function thereof may be detected by a variety of 
assays. The assay may be a simple "yes/no" assay to determine whether there is a 
change in expression or function. The assay may be made quantitative by comparing 
the expression or function of a test sample with the levels of expression or function in 

10 a standard sample. Kits containing NHL, antibodies to NHL, or modified NHL may 
be prepared by known methods for such uses. 

The DNA molecules, RNA molecules, recombinant protein and antibodies of 
the present invention may be used to screen and measure levels of.NHL. The 
recombinant proteins, DNA molecules, RNA molecules and antibodies lend 

15 themselves to the formulation of kits suitable for the detection and typing of NHL. 
Such a kit would comprise a compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier would further comprise reagents such 
as recombinant NHL or anti-NHL antibodies suitable for detecting NHL. The earner 
may also contain a means for detection such as labeled antigen or enzyme substrates 

20 or the like. 

The assays described above can be canned out with cells that have been 
transiently or stably transfected with NHL. The expression vector may be introduced 
into host cells via any one of a number of techniques including but not limited to 
transformation, transfection, protoplast fusion, and electroporation. Transfection is 

25 meant to include any method known in the art for introducing NHL into the test cells. 
For example, transfection includes calcium phosphate or calcium chloride mediated 
transfection, lipofection, infection with a retroviral construct containing NHL, and 
electroporation. The expression vector-containing cells are individually analyzed to 
determine whether they produce NHL protein. Identification of NHL expressing cells 

30 may be done by several means, including but not limited to immunological reactivity 
with anti-NHL antibodies, labeled ligand binding, the presence of host cell-associated 
NHL activity. 

The specificity of binding of compounds showing affinity for NHL is shown 
by measuring the affinity of the compounds for recombinant cells expressing NHL. 
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Expression of human NHL and screening for compounds that bind to NHL or that 
inhibit the binding of a known, radiolabeled ligand of NHL provides an effective 
method for the rapid selection, of compounds with high affinity for NHL. Such 
ligands need not necessarily be radiolabeled but can also be nonisotopic compounds 

5 that can be used to displace bound radiolabeled compounds or that can be used as 
activators in functional assays. Compounds identified by the above method are likely 
to be agonists or antagonists of NHL and may be peptides, proteins, or non- 
proteinaceous organic molecules. 

Accordingly, the present invention is directed to methods for screening for 

10 compounds which modulate the expression of DNA or RNA encoding a NHL protein 
as well as compounds which effect the function of the NHL protein. Methods for 
identifying agonists and antagonists of other receptors are well known in the art and 
can be adapted to identify agonists and antagonists of NHL. For example, 
Cascieri et al. (1992, Molec. Pharmacol. 41 : 1096-1099) describe a method for 

15 identifying substances that inhibit agonist binding to rat neurokinin receptors and thus 
are potential agonists or antagonists of neurokinin receptors. The method involves 
transfecting COS cells with expression vectors containing rat neurokinin receptors, 
allowing the transfected cells to grow for a time sufficient to allow the neurokinin 
receptors to be expressed, harvesting the transfected cells and resuspending the cells 

20 in assay buffer containing a known radioactively labeled agonist of the neurokinin 
receptors either in the presence or the absence of the substance, and then measuring 
the binding of the radioactively labeled known agonist of the neurokinin receptor to 
the neurokinin receptor. If the amount of binding of the known agonist is less in the 
presence of the substance than in the absence of the substance, then the substance is a 

25 potential agonist or antagonist of the neurokinin receptor. Where binding of the 
substance such as an agonist or antagonist to is measured, such binding can be 
measured by employing a labeled substance or agonist. The substance or agonist can 
be labeled in any convenient manner known to the art, e.g., radioactively, 
fluorescently, enzymatically. 

30 Therefore, the present invention includes assays by which modulators of NHL 

are identified. As noted above,, methods for identifying agonists and antagonists are 
known in the art and can be adapted: to identify compounds which effect in vivo levels 
of NHL. Accordingly, the present invention includes a method for determining 
whether a substance is a potential modulator pf mammalian NHL levels that 
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comprises: 

(a) providing test cells by transfecting cells with an expression 
vector that directs the expression of NHL in the cells; 

(b) exposing the test cells to the substance; 

5 ( G ) measuring the amount of binding of the substance to NHL; 

(d) comparing the amount of binding of the substance to NHL in 
the test cells with the amount of binding.of the substance to control cells that have not 
been transfected with NHL or a portion thereof; wherein if the amount of binding of 
the substance is greater in the test cells as compared to the control cells, the substance 

10 is capable of binding to NHL. 

The conditions under which step (b) of the method is practiced are conditions 
that are typically used in the art for the study of protein-ligand interactions: e.g f , 
physiological pH; salt conditions such as those represented by such commonly used 
buffers as PBS or in tissue culture media; a temperature of about 4°C to about 55°C. 

15 The assays described above can be carried out with cells that have been 

transiently or stably transfected with NHL. Transfection is meant to include any 
method known in the art for introducing NHL into the test cells. For example, 
transfection includes calcium phosphate or calcium chloride mediated transfection, 
lipofection, infection with a retroviral construct containing NHL, and electroporation. 

20 Where binding of the substance or agonist to NHL is measured, such binding 

can be measured by employing a labeled substance or agonist. The substance or 
agonist can be labeled in any convenient manner known to the art, e.g., radioactively, 
fluorescently, enzymatically. 

Therefore, the specificity of binding of compounds having affinity for NHL 

25 shown by measuring the affinity of the compounds for recombinant cells expressing 
the cloned receptor or for membranes from these cells. Expression of the cloned 
receptor and screening for compounds that bind to NHL or that inhibit the binding of a 
known, radiolabeled ligand of NHL to these cells provides an effective method for the 
rapid selection of compounds with high affinity for NHL. Such ligands need not 

30 necessarily be radiolabeled but can. also be nomsotopic compounds that can be used to 
displace bound radiolabeled compounds or that can be used as activators in functional 
assays. It is also possible to construct assays wherein compounds are tested for an 
ability to modulate helicase activity in an in vitro- or in vivo- based assay. 
Compounds identified by the above method again are likely to be agonists or 
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antagonists of NHL and may be peptides, proteins, or non-proteinaceous organic 
molecules. As noted elsewhere in this specification, compounds may modulate by 
increasing or attenuating the expression of DNA or RNA encoding NHL, or by acting 
as an agonist or antagonist of the NHL receptor protein. Again, these compounds that 

5 modulate the expression of DNA or RNA encoding NHL or the biological function 
thereof may be detected by a variety of assays. The assay may be a simple "yes/no" 
assay to determine whether there is a change in expression or function. The assay may 
be made quantitative by comparing the expression or function of a test sample with 
the levels of expression or function in a standard sample. 

10 Expression of NHL DNA may also be performed using in vitro produced 

synthetic mRNA. Synthetic mRNA can be efficiently translated in various cell-free 
systems, including but not limited to wheat germ extracts and reticulocyte extracts, as 
well as efficiently translated in cell based systems, including but not limited to 
microinjection into frog oocytes, with microinjection into frog oocytes being 

15 preferred. 

Following expression of NHL in a host cell, NHL protein may be recovered to 
provide NHL protein in active form. Several NHL protein purification procedures are 
available and suitable for use. Recombinant NHL protein may be purified from cell 
lysates and extracts by various combinations of, or individual application of salt 

20 fractionation, ion exchange chromatography, size exclusion chromatography, 
hydroxylapatite adsorption chromatography and hydrophobic interaction 
chromatography. In addition, recombinant NHL protein can be separated from other 
cellular proteins by use of an immunoaffmity column made with monoclonal or . 
polyclonal antibodies specific for full-length NHL protein, or polypeptide fragments 

25 of NHL protein. 

Polyclonal or monoclonal antibodies may be raised against NHL or a synthetic 
peptide (usually from about 9 to about 25 amino acids in length) from a portion of 
NHLdisclosed in SEQ ID NO:2. Monospecific antibodies to NHL. are purified from 
mammalian antisera containing antibodies reactive against NHL or are prepared, as 

30 monoclonal antibodies reactive with NHL using the technique of Kohler and Milstein 
(1975, Nature 256: 495-497). Monospecific antibody as used herein is defined as a 
single antibody species or multiple antibody species with homogenous binding 
characteristics for NHL. Homogenous binding as used herein refers to the ability of 
the antibody species to bind to a. specific antigen or epitope, such as those associated 
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with NHL, as described above. Human NHL-specific antibodies are raised by 
immunizing animals such as mice, rats, guinea pigs, rabbits, goats, horses and the 
like, with an appropriate concentration of NHL protein or a synthetic peptide 
generated from a portion of NHL with or without an immune adjuvant. 
5 Preimmune serum is collected prior to the first immunization. Each animal 

receives between, about 0.1 mg and about 1000 mg of NHL protein associated with an 
acceptable immune adjuvant. Such acceptable adjuvants include, but are not limited 
to, Freund's complete, Freund's incomplete, alum-precipitate, water in oil emulsion 
containing Corynebacterium parvum and tRNA. The initial immunization consists of 
10 NHL protein or peptide fragment thereof in, preferably, Freund's complete adjuvant at 
multiple sites either subcutaneously (SC), intraperitoneal^ (IP) or both. Each animal 
is bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial immunization. 
Those animals receiving booster injections are generally given an equal amount of 
15 NHL in Freund's incomplete adjuvant by the same route. Booster injections are given 
at about three week intervals until maximal titers are obtained. At about 7 days after 
each booster immunization or about weekly after a single immunization, the animals 
are bled, the serum collected, and aliquots are stored at about -20°C. 

Monoclonal antibodies (mAb) reactive with NHL are prepared by immunizing 
20 inbred mice, preferably Baib/c, with NHL protein. The mice are immunized by the IP 
or SC route with about i mg to about 100 mg, preferably about 10 mg, of NHL 
protein in about 0.5 ml buffer or saline incorporated in an equal volume of an 
acceptable adjuvant, as discussed above. Freund's complete adjuvant is preferred. 
The mice receive an initial immunization on day 0 and are rested. for about 3 to about 
25 30 weeks. Immunized mice are gi ven one or more booster immunizations of about 1 
to about 100 mg of NHL in a buffer solution such as phosphate buffered saline by the 
intravenous (IV) route. Lymphocytes, from antibody positive mice, preferably splenic 
lymphocytes, are obtained by removing spleens from immunized mice by standard 
procedures known in the art. Hybridoma cells are produced by mixing the splenic 
30 lymphocytes with an appropriate fusion partner, preferably myeloma cells, under 

conditions which will allow the formation of stable hybridomas. Fusion partners may 
include, but are not limited to: mouse myelomas P3/NS l/Ag4-l; MPC-11; S-194 and 
Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and myeloma cells 
' are fused in polyethylene glycol, about 1000 mol. wt., at concentrations from about 
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30% to about 50%.. Fused hybridoma cells are selected by growth in hypoxanthine, 
thymidine and aminopterin supplemented Dulbecco's Modified Eagles Medium 
(DMEM) by procedures known in the art. Supernatant fluids are collected form 
■ growth positive wells on about days 14, 18, and 21 and are screened for antibody 
5 production by an immunoassay such as solid phase immunoradioassay (SPIRA) using 
NHL as the antigen. The culture fluids are also tested in the Ouchterlony precipitation 
assay to determine the isotype of the mAb. Hybridoma cells from antibody positive 
wells are cloned by a technique such as the soft agar technique of MacPherson, 1973, 
Soft Agar Techniques, in Tissue Culture Methods and Applications, Kruse and 
10 Paterson, Eds., Academic Press. 

Monoclonal antibodies are produced in vivo by injection of pristine primed 
Balb/c mice, approximately 0.5 ml per mouse, with about 2 x 10 6 to about 6 x 10 6 
hybridoma cells about 4 days after priming. Ascites fluid is collected at 
approximately 8-12 days after cell transfer and the monoclonal antibodies are purified 
15 by techniques known in the art. 

In vitro production of anti- NHL mAb is carried out by growing the hybridoma 
in DMEM containing about 2% fetal calf serum to obtain sufficient quantities of the 
specific mAb. The mAb are purified by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by 
20 various serological or immunological assays which include, but are not limited to, 

precipitation, passive agglutination, enzyme-linked immunosorbent antibody (ELISA) 
technique and radioimmunoassay (RIA) techniques. Similar assays are used to detect 
the presence of NHL in body fluids or tissue and cell extracts. 

It is readily apparent to those skilled in the art that the above described 
25 methods for producing monospecific antibodies may be utilized to produce antibodies 
specific for NHL peptide fragments, or a respective full-length NHL. 

NHL antibody affinity columns are made, for example, by adding the 
antibodies to Affigel-10 (Biorad), a gel support which is pre-activated with N- 
hydroxysuccinimide esters such that the antibodies form covalent linkages with the 
30 agarose gel bead support. The antibodies are then coupled to the gel via amide bonds 
with the spacer arm. The remaining activated esters are then quenched with 1M 
ethanolamine HC1 (pH 8). The column is washed with water followed by 0.23 M 
glycine HC1 (pH 2.6) to remove any non-conjugated.antibody or extraneous protein. 
The column is then equilibrated in phosphate buffered saline (pH 7.3) and the cell 
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culture supematants or cell extracts containing full-length NHL or NHL protein 
fragments are slowly passed through the column. The column is then washed with 
phosphate buffered saline until the optical density (A 2 so) falls to background, then the 
protein is eluted with 0.23 M glycine-HCl (pH 2.6). The purified NHL proteinis then 

5 dialyzed against phosphate buffered saline. 

Pharmaceutical^ useful compositions comprising modulators of NHL may be 
formulated according to known methods such as by the admixture of a 
pharmaceutical^ acceptable carrier. Examples of such carriers and methods of 
formulation may be found in Remington's Pharmaceutical Sciences. To form a 

10 pharmaceutical^ acceptable composition suitable for effective administration, such 
compositions will contain an effective amount of the protein, DNA, RNA, modified 
NHL, or either NHL agonists or antagonists including tyrosine kinase activators or 
inhibitors. 

Therapeutic or diagnostic compositions of the invention are administered to an 
15 individual in amounts sufficient to treat or diagnose disorders. The effective amount 

may vary according to a variety of factors such as the individual's condition, weight, 

sex and age. Other factors include the mode of administration. 

The pharmaceutical compositions may be provided to the individual by a 

variety of routes such as subcutaneous, topical, oral and intramuscular. 
20 The term "chemical derivative" describes a molecule that contains additional 

chemical moieties which are not normally a part of the base molecule. Such moieties 

may improve the solubility, half-life, absorption, etc. of the base molecule. 

Alternatively the moieties may attenuate undesirable side effects of the base molecule 

or decrease the toxicity of the base molecule. Examples of such moieties are 
25 described in a variety of texts, such as Remington's Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed herein may be used 

alone at appropriate dosages. Alternatively, co-administration or sequential 

administration of other agents may be desirable: 

The present invention also has the objective of providing suitable topical, oral, 
30 systemic and parenteral pharmaceutical formulations for use in the novel methods of 

treatment of the present invention. The compositions containing compounds 

identified according to this invention as the active ingredient can be administered in a 

wide variety of therapeutic dosage forms in conventional vehicles for administration. 

For example, the compoundis can be administered in such oral dosage forms as tablets, 
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capsules (each including tinned release and sustained release formulations), pills, 
powders, granules, elixirs, tinctures, solutions, suspensions, syrups and emulsions, or 
by injection. Likewise, they may also be administered in intravenous (both bolus and 
infusion), intraperitoneal, subcutaneous, topical with or without occlusion, or 

5 intramuscular form, all using forms well known to those of ordinary skill in the 
pharmaceutical arts.. 

Advantageously, compounds of the present invention may be administered in a 
single daily dose, or the total daily dosage may be administered in di vided doses of 
two, three or four times daily. Furthermore, compounds for the present invention can 

10 be administered in intranasal form via topical use of suitable intranasal vehicles, or 
via transdermal routes, using those forms of transdermal skin patches well known to 
those of ordinary skill in that art. To be administered in the form of a transdermal 
delivery system, the dosage administration will, of course, be continuous rather than 
intermittent throughout the dosage regimen. 

15 For combination treatment with more than one active agent, where the active 

agents are in separate dosage formulations, the active agents can be administered 
concurrently, or they each can be administered at separately staggered times. 

The dosage regimen utilizing the compounds of the present invention is 
selected in accordance with a variety of factors including type, species, age, weight, 

20 sex and medical condition of the patient; the severity of the condition to be treated; 
the route of administration; the renal, hepatic and cardiovascular function of the 
patient; and the particular compound thereof employed. A physician or veterinarian 
of ordinary skill can readily determine and prescribe the effective amount of the drug 
required to prevent, counter or arrest the progress of the condition. Optimal precision 

25 in achieving concentrations of drug within the range that yields efficacy without 
toxicity requires a regimen based on the kinetics of the drug's availability totarget 
sites. This involves a consideration of the distribution, equilibrium, and elimination 
of a drug. 

The present invention also relates to a non-human transgenic animal which is 
30 useful for studying the ability of a variety of compounds to act as modulators of NHL, 
or any alternative functional NHL in vivo by providing cells for culture, in vitro. In 
reference to the transgenic animals of this invention, reference is made to transgenes 
and genes. As used herein, a transgene is a genetic construct including a gene. The 
transgene is integrated into one or more chromosomes in the cells in an animal by 
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methods known in the art. Once integrated, the transgene is carried in at least one 
place in the chromosomes of a transgenic animal. Of course, a gene is a nucleotide 
sequence that encodes a protein,, such as one or a combination of the cDNA clones 
described herein. The gene and/or transgene may also include genetic regulatory 

5 elements and/or structural elements known in the art. A type of target cell for 

transgene introduction, is the embryonic stem cell (ES). ES cells can be obtained from 
pre-implantation embryos cultured in vitro and fused with embryos (Evans et at., 
1981, Nature 292:154-156; Bradley et al., 1984, Nature 309:255-258;* Gossler et a]., 
1986, Proc. Natl Acad. Sci. USA 83:9065-9069; and Robertson et al., 1986 Nature 

10 322:445-448). Transgenes can be efficiently introduced into the ES cells by a variety 
of standard techniques such as DNA transfection, microinjection, or by retrovirus- 
mediated transduction. The resultant transformed ES cells can thereafter be combined 
with blastocysts from a non-human animal. The introduced ES cells thereafter 
colonize the embryo and contribute to the germ line of the resulting chimeric animal 

15 (Jaenisch, 1988, Science 240: 1468-1474). It will also be within the purview of the 
skilled artisan to produce transgenic or knock-out invertebrate animals 
(e.g., C elegans) which express the NHL transgene in a wild type background as well 
in C elegans mutants knocked out for one or both of the NHL subunits. These 
organisms will be helpful in further determining the dominant negative effect of NHL 

20 as well as selecting from compounds which modulate this effect. 

The present invention also relates to a non-human transgenic animal which is 
heterozygous for a functional NHL gene native to that animal. As used herein, 
functional is used to describe a gene or protein that, when present in a cell or in vitro 
system, performs normally as if in a native or unaltered condition or environment. 

25 The animal of this aspect of the invention is useful for the study of the retinal specific 
expression or activity of NHL in an animal having only one functional copy of the 
gene. The animal is also useful for studying the ability of a variety of compounds to 
act as modulators of NHL activity or expression in vivo or, by providing cells for 
culture, in vitro. It is reiterated that as used herein, a modulator is a compound that 

30 causes, a change in the expression or activity of NHL, or causes a change in the effect 
of the interaction of NHL with its ligand(s), or other protein(s). In an embodiment of 
this aspect, the animal is used in a method for the preparation of a further animal 
which lacks a functional native NHL gene. In another embodiment, the animal of this 
aspect is used in a method to prepare an animal which expresses a non-native NHL 
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gene in the absence of the expression of a native NHL gene. In particular 
embodiments the non-human animal is a mouse. In further embodiments the non- 
native NHL is a wild-type human NHL which is disclosed herein, or any other 
biologically equivalent form of human NHL gene as also disclosed herein. 

5 In reference to the transgenic animals of this invention, reference is made to 

transgenes and genes. As used herein, a transgene is a genetic construct including a 
gene. The transgene is integrated into one or more chromosomes in the cells in an 
animal by methods known in the art. Once integrated, the transgene is carried in at 
least one place in the chromosomes of a transgenic animal. Of course, a gene is a 

10 nucleotide sequence that encodes a protein, such as human or mouse NHL. The gene 
and/or transgene may also include genetic regulatory elements and/or structural 
elements known in the art. 

Another aspect of the invention is a non-human animal embryo deficient for 
native NHL expression, This embryo is useful in studying the effects of the lack of 

15 NHL on the developing animal. In particular embodiments the animal is a mouse. 
The animal embryo is also useful as a source of cells lacking a functional native NHL 
gene. The cells are useful in in vitro culture studies in the absence of NHL. 

An aspect of this invention is a method to obtain an animal in which the cells 
lack a functional gene NHL native to the animal. The method includes providing a 

20 gene for an altered form of the NHL gene native to the animal in the form of a 

transgene and targeting the transgene into a chromosome of the animal at the place of 
the native NHL gene. The transgene can be introduced into the embryonic stem cells 
by a variety of methods known in the art, including eiectroporation, microinjection, 
and lipofection. Cells carrying the transgene can then be injected into blastocysts 

25 which are then implanted into pseudopregnant animals. In alternate embodiments, the 
transgene-targeted embryonic stem cells can be coincubated with fertilized eggs or 
morulae followed by implantation into females. After gestation, theanimals obtained 
are chimeric founder transgenic animals. The founder animals can be used in further 
embodiments to cross with wild-type animals to produce FI animals heterozygous for 

30 the altered NHL gene. In further embodiments, these heterozygous animals can be 
interbred to obtain the non-viable transgenic embryos whose somatic and germ cells 
are homozygous for the altered NHL gene and thereby lack a functional NHL gene. In 
other embodiments, the heterozygous animals can be used to produce cells lines. In 
preferred embodiments, the animals are mice. 
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A further aspect of the present invention is a transgenic non-human animal 
which expresses a non-native NHL on a native NHL null background. In particular 
embodiments, the null background is generated by producing an animal with an 
altered native NHL gene that is non-functional, i.e. a knockout. The animal can be 

5 heterozygous (i.e., having a different allelic representation of a gene, on each of a pair 
of chromosomes of a diploid genome) or homozygous (i.e., having the same 
representation of a gene on each of a pair of chromosomes of a diploid genome) for 
the altered NHL gene and can be hemizygous (i.e., having a gene represented- on only 
one of a pair of chromosomes of a diploid genome) or homozygous for the non-native 

10 NHL gene. In preferred embodiments,. the animal is a mouse. In particular 

embodiments the non-native NHL gene can be a wild-type or mutant allele including 
those mutant alleles associated with a disease. In further embodiments, the non-native 
NHL is a human NHL. In a further embodiment the non-native NHL gene is operably 
linked to a promoter. As used herein, operably linked is used to denote a functional 

15 connection between two elements whose orientation relevant to one another can vary. 
In this particular case, it is understood in the art that a promoter can be operably 
linked to the coding sequence of a gene to direct the expression of the coding 
sequence while placed at various distances from the coding sequence in a genetic 
construct. 

20 An aspect of this invention is a method of producing transgenic animals 

having a transgene including a non-native NHL gene on a native NHL null 
background. The method includes providing transgenic animals of this invention 
whose cells are heterozygous for a native gene encoding a functional NHL protein and 
an altered nati ve NHL gene. These animals are crossed with transgenic animals of 

25 ' this invention that are hemizygous for a transgene including a non-native NHL gene to 
obtain animals that are both heterozygous for an altered native NHL gene and 
hemizygous for a non-native NHL gene. The latter animals are interbred to obtain 
animals that are homozygous or hemizygous for the non-native NHL and are 
homozygous for the altered native NHL gene. In particular embodiments, cell lines 

30 are produced from any of the animals produced in the steps of the method. 

The transgenic animals and cells of this invention are useful in the 
determination of the in vivo function of a non-native NHL in the central nervous 
. system and in other tissues of an animal. The animals are also useful in studying the 
tissue and temporal specific expression patterns of a non-native NHL throughout the 
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- animals. The animals are also useful in determining the ability for various forms of 
wild-type and mutant alleles of a non-native NHL to rescue the native NHL null 
deficiency. The animals are also useful for identifying and studying the ability of a 
variety of compounds to act as modulators of the expression or activity of a non-native 
5 NHL in vivo, or by providing cells for culture, for in vitro studies. 

As used herein, a "targeted gene" or "Knockout" (KO) is a DN A. sequence 
introduced into the germline of a non-human animal by way of human intervention, 
including but not limited to, the methods described herein. The targeted genes of the 
invention include nucleic acid sequences which are designed to specifically alter 
10 cognate endogenous alleles. An altered NHL gene should not fully encode the same 
NHL as native to the host animal, and its expression product can be altered to a minor 
or great degree, or absent altogether. In cases where it is useful to express a non- 
native NHL gene in a transgenic animal in the absence of a native NHL gene we 
prefer that the altered NHL gene induce a null lethal knockout phenotype in the 
15 animal. However a more modestly modified NHL gene can also be useful and is 
within the scope of the present invention. 

A type of target cell for transgene introduction is the embryonic stem cell (ES). 
ES cells can be obtained from pre-implantation embryos cultured in vitro and fused 
with embryos (Evans et al., 1981, Nature 292: 154-156; Bradley et al., 1984, Nature 
20 309:255-258; Gossler et al., 1986, Proc. Natl. Acad, Sci. USA 83:9065-9069; and 

Robertson et al., 1986 Nature 322:445-448). Transgenes can be efficiently introduced 
into the ES ceils by a variety of standard techniques such as DNA transfection, 
microinjection, or by retrovinis-mediated transduction. The resultant transformed ES 
cells can thereafter be combined with blastocysts from a non-human animal. The 
25 introduced ES cells thereafter colonize the embryo and contribute to the germ line.of 
the resulting chimeric animal (Jaenisch, 1988, Science 240: 1468-1474). 

The methods for evaluating the targeted recombination events as well as the 
resulting knockout mice are readily available and known in the art. Such methods 
include, but are not limited to DNA (Southern) hybridization to detect the targeted 
30 allele, polymerase chain reaction (PCR), polyacrylamide gel electrophoresis (PAGE) 
and Western blots to detect DNA, RNA and protein. 

The following examples are provided to illustrate the present invention 
without, however, limiting the same hereto. 
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EXAMPLE 1 
Characterization of DNA Molecules Encoding NHL 
M68/DcR3 identification - The human osteoprotegerin (OPG) sequence (Acc. 

5 # U94332), which is a member of the TNFR-related family, was used to searched 
Genbank using the programs TBLASTN and TFASTX3 to identify novel gene family 
members. Two EST sequences (GenBank Acc. # AAI5570I and AA025672) were 
identified that showed sequence similarities to the cysteine repeats of the OPG 
sequence. These EST sequences were then used to identify additional EST sequences, 

10 which formed a single EST cluster (GenBank Acc. #s aa577603, aa603704, aa6 13366, 
aal58406, w67560, aa325843, aal55646, aa025673, aa5 14270, m91489). Two clones 
were further characterized, which were derived from colon tumor and germ cell tumor 
libraries (Research Genetics, Inc). DNA sequence analysis revealed two alternatively 
spliced forms of the 5'-end UTR of M68/DcR3. The M68/DcR3 open reading frame 

15 was confirmed by sequence analysis of clones obtained by PCR cloning from a normal 
human cDNA library (Clontech). 

M68/DcR3 BAC identification and sequencing - To further delineate the gene 
structure of M68/DcR3, genomic DNA was obtained using a human "Down to the 
Well" ™ genomic bacterial artificial chromosome (BAC) library (Genome Systems, 

20 Inc.) according to the manufacturer s protocol. Two sets of PCR primers, C68.36F: 
5'-CACAGGTTCAGCATGTTTGTGCGTC-3' (SEQ ID NO:4) and C68.275R: 
5'-CACAGTCCCTGCTGGCCTCTGTCTA-3' (SEQ ID NO:5), and E68.7 15F; 
. 5'-CAGGACATCTCCATCAAGAGGCTGC-3 (SEQ ID NO:6) and E68.972R: 
5'-AATAAGAGGGGGCCAGGATCAGTGC-3' (SEQ ID NO:7), were used to carry 

25 out PCR reactions to identify positive wells that contained the full-length M68/DcR3 
gene. The PCR conditions used were 94°C for 9min, 35 cycles of (94°C, 30 sec, 68°C 
3 min.) followed by 72°C for 10 min. Two positive BAC clones were identified and 
characterized by restriction digestion and B AC-end sequence analyses, of which 
hbml68 was selected for shotgun sequencing. 

30 A shot-gun library for BAC hbml68 was constructed using.a conventional 

strategy. Briefly, two 150-ml bacterial cultures were combined and purified using a 
modified protocol of the plasmid-Maxi kit (QIAGEN) followed by CsCl gradient 
purification. After butanol extraction and isopropanol precipitation, BAC DNA was 
nebulizied at 10 psi for 60 seconds to generate randomly sheared fragments. 
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Following ethanol precipitation, the fragments were end-repaired using T4 polymerase 
(Promega) and BstXI adaptors (Invitrogen) were Iigated overnight. Removal of 
excess, unligated adaptors and size selection was performed using a cDNA sizing 
column (Life Technologies, Inc.) to generate genomic fragments in the size range of 

5 1500 to 3000 bp. Adaptor Iigated fragments were cloned into a modified pBlueScript 
SK + vector (Stratagene) and transformed in XL2-Blue ultracompentent cells 
(Stratagene). Approximately 1000 clones were isolated, plasmids were purified using 
the Turbo miniprep kits (QIAGEN), and both plasmid ends were sequenced with the 
BigDye terminator kits (Perkin-Elmer). Sequence data were assembled using 

10 Phred/Phrap/Consed where single-stranded and gap regions were closed using a 
directed sequencing strategy. 

NHL identification and sequencing - The genomic clone for the NHL gene 
was obtained and sequenced. The transcript was identified through exon prediction 
using GRAIL2 and sequence alignment to a contiguous 4.5 kilobase region of 

15 chromosome 4 (88% sequence identity). The complete exon structure of NHL was 
subsequently confinmed by RT-PCR analysis. The exon structure was confirmed by 
RT-PCR using polyA RNA from a human colorectal adenocarcinoma cell line, 
SW480 (Clontech). Primers were designed based on the genomic sequence that were 
predicted to be exons. RT-PCR reaction were carried out with SW480 polyA RNA 

20 using standard conditions with TaqGold Enzyme at 94°C for 12min, 35 cycles of 
(94°C, 30 sec, 60C, 30 sec, and 68°C 2-6 min.) followed by 68°C for 7 min. Most 
sequence confrimation was accomplished by RT-PCR, although first junction between 
exon 1 and 2 was confirmed by 5'RACE and junctions between exon 26-29 were by 
RCCA. The primers used were as follows: 



Junction of Exons 


Confirmed bv Primers 


H01/H02 


hdkw (5'RACE) 


H02/H03 


hdiy.hdiz 


H03-H09 


hdid,hdie,hdja,hdjb 


H09-H13 - 


hdja,hdie 


H13-H18 


hdje.hdjf 


H18-H23 


hdjg.hdjh 


H23-H26 


hdji.hdjj 


H26-H29 


hdkv,r543(RCCA) 


H29-H31 


hdij,hdmu,hdnd;hdne 
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H31/H32 hdij,hdmu 

H32/H34 hdip,hdil,hdmv,hdik,hdli 

H34/H35 hdng.hdnh 

5 HDED - 5'-GTGAATGGCATCCTGGAGAG-3' (SEQ ID NO:8); 

HDCE - 5'-GTCTCCAGGCAGCTCAACAG-3' (SEQ ID N0:9); 

HDD - 5'-ACCCTGTCCCTCCTGTCTGA-3'(SEQ ID NO:10); 

HDIY - 5 - AG ACCCT A AG ATGTTC GG AG-3 ' (SEQ ID NO: 11); 

HDIZ - 5 '-G ATG ACCTGTGTG AGTTGCG-3 ' (SEQ ID NO: 12); 
10 HDJ A - 5'-CGCAACTCACACAGGTCATC-3' (SEQ ID NO: 13); 

HDJB - 5'-GGAGTCAGGTCAAAGGATGC-3' (SEQ ED NO: 14); 

HDJC - 5-GCATCCTTTGACCTGACTCC-3'(SEQ ID NO: 15); 

HDJD - 5-GGTCTGAAACGTGATCTGGG-3' (SEQ ED NO: 16); 

HDJE - 5 -CCCAGATCACGTTTCAGACC-3' (SEQ ID NO: 17); 
15 HDJF - 5'-CG ATG ATGTGTGGGTTCTCC-3 ' (SEQ ID NO:18); 

HDJG - 5'-GGAGAACCCACACATCATCG-3' (SEQ ID NO: 19); 

HDJH - 5'-CGTGTCTGAGAAGTCCAGCC-3' (SEQ ID NO:20); 

HDJI - 5-GGCTGGACTTCTC AGACACG-3 ' (SEQ ID NO:21); 

HDJJ - 5'-ACAGCATCTTCTCCACGCAC-3' (SEQ ID NO:22); 
20 HFMU - 5 -AGTCCTCTGGCTTTGC AGTG-3' (SEQ ID NO:23); 

HDKV - 5 -TGTGCGTGGAGAAGATGCTG-3' (SEQ ID NO:24); 

HDKW - 5'-GGCTGGAAAGGGAAGTCTAC-3* (SEQ ID NO:25); 

HDND - 5 -TGGTTCAGGTGCTCTTGGGG-3' (SEQ ID NO:26); 

HDNE - 5 -CGTG AAGCAGGAGTTGAGCC-3' (SEQ ID NO:27); 
25 HDIK - 5'-ATCTTGCTCTGGGTCTTCCC-3' (SEQ ID NO:28); 

HDIL - 5'-CACTGCAAAGCCAGAGGACT-3' (SEQ ID NO:29); 

HDD? - 5-ATAAGCAAGACGACGACCTC-3' (SEQ ID NO:30); 

HDLI - 5 '-CT ATTCTGTTGGGTGGGTTC-3 ' (SEQ ID NO:31); 

HDMV - 5 -CGTGCCTCCTGTGCTTACCC-3'(SEQ ID NO:32); 
30 HDNG - 5 -CAGACCCCAAGGTAGCTCAG-3' (SEQ ID NO:33); 

HDNH - 5 -GGAAGACCCAGAGCAAGATC-3' (SEQ ID NO:34). 
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Amplified product were subject to direct sequencing after purification from an agarose 
gel or cloned into a TOPO PCR cloning vector (Invitrogen) for sequencing. 
Multiple sequence alignment of NHL to known helicases showed that NHL contains 

5 all the seven critical helicase domains. BLAST analysis of the predicted. 1,2 19 amino 
acid sequence (see Figure 2, SEQ ID NO:2) reveal an approximately 26% sequence 
identity and 48% sequence similarity to the RAD3/ERCC2 gene family of DNA 
helicases (see Figure 3). Review of this sequence data shows that two partial human 
cDNA clones (Acc. No. al080127 and ab02901 1) are deposited. No. al080127 covers 

10 exon 25-35 while ab02901 1 covers exons 9-35. Ab02901 1 starts at amino acid 240 
of the full length human NHL protein disclosed herein, but also differs at exon 35 and 
appears to be a fusion transcript with M68. This cDNA was isolated from brain 
tissue, which has been known to express rare transcripts. 

15 EXAMPLE 2 

. Northern Analysis of human NHL Expression 
Messenger RNA (mRNA) obtained from human brain, heart, skeletal muscle, 
colon, thymus, spleen, kidney, liver, small intestine, placenta, lung, and peripheral 
blood leukocytes. Two ng of polyA + RNA were run on each lane a denaturing 

20 formaldehyde 1% agarose gel, and transferred to a charged-modified nylon membrane. 
The probe was made using a 733 bp fragment derived from 1 174-1907 nt of the NHL 
cDNA. This fragment was labeled via the 32 P dCTP random priming method 
(Ambion). Hybridization was carried in ExpressHyb (Clontech) according to the 
manufacturer's protocol except for the final wash, which was at 55°C. Membranes 

25 were exposed to X-ray film with intensifying screen at -80°C overnight. The Northern 
data is presented in Figure 4. Note hybridization of the NHL probe to an 
approximately 4.4 kb transcript. The 7.5 kb transcript may suggest an alternative 
splicing of the NHL RNA. 
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EXAMPLE 3 
Chromosomal localization 
To map the position of M68/NHL in the human. genome, primers C68.36F and 

5 C68;275R, were used to carry out PCR reactions to 93 clones of the MIT GeneBridge 
4 panel (Research Genetics) and results were submitted to MTT for analysis. 
M68/DcR3 was mapped to the extreme telomere of chromosome 20, at 20ql3.3, 28cR 
from D20S173 with a lod score of 13. An analogous procedure was also carried out 
with the 83 clones of the Stanford G3 radiation hybrid panel, with PCR results 

10 submitted to the Stanford Genome Center for analysis.. Analysis usinganother pair of 
PCR primers specific to NHL yielded the same result. For fluorescence in situ (FISH) 
analysis, the normal human male fibroblast cell line, L136 (Coriell Cell Repository, 
Camden, NJ) was arrested in mitosis with colcemid (10 \i^m\). A human 
chromosome 20 a-satellite probe (Vysis, Downers Grove, IL) was directly labeled 

15 with Spectrum- Orange dUTP and was used to identify chromosome 20. The M68 
BAG clone was directly labeled with SpectrumGreen dUTP by nick translation 
(Vysis). Slides were counterstained with DAPI stain and viewed under an Olympus 
microscope with narrow blue and DAPI/TRITC filters. Fifty metaphase cells were 
scored to verify that the M68 probe was located on the same chromosome as the 

20 Human Chromosome 20 probe. Radiation hybrid chromosomal mapping reconfirms 
that it is linked to M68 locus,at 20ql 3.3. 
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WHAT IS CLAIMED IS: 

T. A purified DNA molecule encoding a mammalian NHL protein. 

2. A purified DNA molecule of claim 1 encoding a human NHL protein 
5 which comprises the amino acid sequence 

MPKIVLNGVT VDFPFQPYKC QQEYMTKVLE CLQQKVNGIL ESPTGTGKTL CLLCTTLAWR 
EHLRDGISAK KIAERAQGEL FPDRALSSWG NAAAAAGDPI ACYTDIPKII YASRTHSQLT 
QVINELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI KLCRKKVASR SCHFYNNVEE 
KSLEQELASP ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD IIFMPYNYLL DAKSRRAHNI 

10 DLKGTWIFD EAHNVEKMCE ESASFDLTPH DLASGLDVID QVLEEQTKAA QQGEPHPEFS 
-ADSPSPGLNM ELEDIAKLKM ILLRLEGAID AVELPGDDSG VTKPGSYIFE LFAEAQXTFQ 
TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF SVDPSEGSPG SPAGLGALQS 
YKVHIHPDAG HRRTAQRSDA WSTTAARKRG KVLSYWCFSP GHSMHELVRQ GVRSLILTSG 
TLAPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGWPRGP DGAQLSSAFD RRFSEECLSS- 

15 LGKALGNIAR WPYGLLIFF PSYPVMEKSL EFWRARDLAR KMEALKPLFY EPRSKGSFSE - 
TISAYYARVA APGSTGATFL AVCRGKASEG LDFSDTNGRG VIVTGLPYPP RMDPRWLKM 
QFLDEMKGQG GAGGQFLSGQ EWYRQQASRA VNQAIGRVIR HRQDYGAVFL CDHRFAFADA 
RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR VAERTMPAPA PRATAPSVRG EDAVSEAKSP 
GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG DPESSLCVEY EQEPVPARQR PRGLLAALEH 

20 SEQRAGSPGE EQAHSCSTLS LLSEKRPAEE PRGGRKKIRL- VSHPEEPVAG AQTDRAKLFM 
VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA EDPKKHNLLQ GFYQFVRPHH 
KQQFEEVCIQ LTGRGCGYRP EHSIPRRQRA QPVLDPTGRT APDPKLTVST AAAQQLDPQE 
HLNQGRPHLS PRPPPTGDPG SQPQWGSGVP RAGKQGQHAV SAYLADARRA LGSAGCSQLL 
AALTAYKQDD DLDKVLAVLA ALTTAKPEDF PLLHRFSMFV RPHHKQRFSQ TCTDLTGRPY 

25 PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 
GPSQSSGPPH GPAASEWGL* (SEQ ID NO : 2 ) . 

3. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 2. 

30 

4. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 3.. 
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5. A process for expressing a NHL protein in a recombinant host cell, 
comprising: 

(a) transfecting the expression vector of claim 3 into a suitable host cell; and, 
5 (b) culturing the host cells of step (a) under conditions which allow 

expression of said NHL protein from said expression vector. 

6. A purified DNA molecule encoding a human NHL protein which 
consists of the amino acid sequence 

10 MPKIVLNGVT VDFPFQPYKC QQEYMTKVLE. CLQQKVNGIL ESPTGTGKTL CLLCTTLAWR 
EHLRDGISAR KIAERAQGEL FPDRALSSWG NAAAAAGDPI ACYTDIPKII YASRTHSQLT 
QVINELRNTS YRPKVCVLGS REQLCIHPEV KKQESNHLQI HLCRKKVASR SCHFYNNVEE 
KSLEQELASP ILDIEDLVKS GSKHRVCPYY LSRNLKQQAD IIFMPYNYLL DAKSRRAHNI 
DLKGTWIFD EAHNVEKMCE ESASFDLTPH DLASGLDVID QVLEEQTKAA QQGEPHPEFS 
15 ADSPSPGLNM ELEDIAKLKM ILLRLEGAID AVELPGDDSG VTKPGSYIFE LFAEAQITFQ 
TKGCILDSLD QIIQHLAGRA GVFTNTAGLQ KLADIIQIVF SVDPSEGSPG SPAGLGALQS 
■ YKVHIHPDAG HRRTAQRSDA WSTTAARKRG KVLSYWCFSP GHSMHELVRQ . GVRS LILTS G 
TLAPVSSFAL EMQIPFPVCL ENPHIIDKHQ IWVGWPRGP DGAQLSSAFD RRFSEECLSS 
LGKALGNIAR WPYGLLIFF PSYPVMEKSL EFWRARDLAR KMEALKPLFV EPRSKGSFSE 
20 TISAYYARVA APGSTGATFL AVCRGKASEG LDFSDTNGRG VIVTGLPYPP RMDPRWLKM . 
QFLDEMKGQG GAGGQFLSGQ EWYRQQASRA VNQAIGRVIR HRQDYGAVFL CDHRFAFADA , 
RAQLPSWVRP HVRVYDNFGH VIRDVAQFFR VAERTMPAPA PRATAPSVRG EDAVSEAKSP 
GPFFSTRKAK SLDLHVPSLK QRSSGSPAAG DPESSLCVEY EQEPVPARQR PRGLLAALEH 
SEQRAGSPGE' EQAHSCSTLS LLSEKRPAEE PRGGRKKIRL VSHPEEPVAG AQTDRAKLFM 
25 VAVKQELSQA NFATFTQALQ DYKGSDDFAA LAACLGPLFA EDPKKHNLLQ GFYQFVRPHH 
KQQFEEVCIQ LTGRGCGYRP EHSIPRRQRA QPVLDPTGRT APDPKLTVST AAAQQLDPQE 
HLNQGRPHLS PRPPPTGDPG SQPQWGSGVP RAGKQGQHAV SAYLADARRA LGSAGCSQLL 
AALTAYKQDD DLDKVLAVLA ALTTAKPEDF PLLHRFSMFV RPHHKQRFSQ TCTDLTGRPY 
PGMEPPGPQE ERLAVPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 
30 GPSQSSGPPH GPAASEWGL* (SEQ ID NO: 2). 

7. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 6." 



-97- 



WO 01/42434 PCT/USOO/33065 



8. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 7. 

9. A process for expressing a NHL protein in a recombinant host cell, 
5 comprising: 

(a) transfecting the expression vector of claim 7 into a suitable host cell; and, 

(b) culturing the host cells of step (a) under conditions which allow 
expression of said NHL protein from said expression vector. 

!0 10. A purified DNA molecule which comprises the nucleotide sequence as 

set forth in SEQ ID NO: 1. 

11. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 10. 

15 

12. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 11. - 

13. A purified DNA molecule which consists of the nucleotide sequence as 
20 set forth in SEQ ID NO: 1. 

14. An expression vector for expressing a NHL protein in a recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 13. 

25 15. A host cell which expresses a recombinant NHL protein wherein said 

host cell contains the expression vector of claim 14. 

16. A purified DNA molecule of claim 13 which consists of the nucleotide 
sequence from about nucleotide 828 to about nucleotide 4587, as set forth in SEQ ID 

30 NO:l. 

17. An expression vector for expressing a NHL protein in a. recombinant 
host cell wherein said expression vector comprises a DNA molecule of claim 16. 
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18. A host cell which expresses a recombinant NHL protein wherein said 
host cell contains the expression vector of claim 17. 

19. A substantially purified NHL protein which comprises the amino acid 
5 sequence as set forth in SEQ ID NO:2. 

20. A substantially purified NHL protein which consists of the amino acid 
sequence as set forth in SEQ ID NO:2. 

10 21. A substantially purified NHL protein which comprises the amino acid 

sequence as set forth in SEQ ID NO:2, wherein said protein is a product of a DNA 
expression vector comprising SEQ ID NO:l and contained within a recombinant host 
cell. 

15 22. A method of identifying modulators of NHL activity, comprising: 

(a) combining a test compound with a NHL protein, wherein NHL comprises 
the amino acid sequence as set forth in SEQ ID NO:2; and, 

(b) measuring the effect of the test compound on the NHL protein. 

20 23. An isolated DNA molecule which comprises the nucleotide sequence 

. as set forth in SEQ ID NO:3. 

24. An isolated DNA molecule of claim 20 which comprises from about 
nucleotide 47000 to about nucleotide 85500 of SEQ ID NO:3. 

25 

25. An isolated DNA molecule of claim 23 which comprises from about 
nucleotide 47095 to about nucleotide 853 16 of SEQ ID NO:3. 

26. A substantially purified NHL protein of claim 21 wherein said protein 
30 is a product of a DNA expression vector comprising from about nucleotide 828 to 

nucleotide 4587, as set forth in SEQ ID NO:l, and contained within a recombinant 
host ceil. 
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*rr™rrrcT rTTnTAGfC AGTGCCGGGT GCTGGGGACT CAGGGAGGCC CGCCGGGACC ACTGCGGGAC 

si IE Ss SS2SS SSK 2ES2 see 
HE S Ess SSSSS EKE SSSS ssssss 

SE iSE ss gs = iE iii 

■linn 

IE iS iii Ss sees 
Sii S S ssss esse see ssss 
SI S'Eb ssasss ssss sssss see 

SfS?£? Sggga cagtcgtgat ctttgacgaa GCTCACAACG TGGAGMGAT GTGTGAAGAA . 

T^ATttT TTCACOGK TCCCCATGAC CTGGCTTCAG GACTGGACGT CATAGACCAG GTGCTGGAGG 
S SSS£ CAGGGTGAGC CCCACCCGGA GnCAOCGCG GAaCCCCCA = 
PAArflTrrAfi TTGGAAGACA TTGCAAAGCT GAAGATGATC CTGCTGCGCC TGGAGGGGGC UUUaAibU 
™™ ctSga Sgcggtctc ACCAAGCCAG GGAGCTACAT CTTTGAGCTG TTTGCTGMG 
JS SttwScc MGGGCTGCA TCCTGGACTC GCTGGACCAG ATCATCCAGC ACCTGGCAGG 
S SStSoa ACACGGCCGG ACTGCAGAAG CTGGCGGACA TTATCCAGAT TGTGTTCAGT 
S Sgcag CCCTGGTTCC CCAGCAGGGC tgggggcctt ACAGTCCTAT AAGGTGCACA 
SS™t£ TGCTGGTCAC CGGAGGACGG GTCAGCGGTC TGATGCCTGG AGCACCACTG CAGCCAGAAA 
S SSSfiS AcScTT CAGTCCCGGC CACAGCATGC ACaAGCTGGT CCgCAGGGC 
GTCCGCTCCC TCATCCTTAC CAGCGGCACG CTGGCCCCGG TGTCCTCCTT TGCTCTGGAG ATGCAGATCC 
" rrr . rT rTKrrrGGAG AACCCACACA TCATCGACAA GCACCAGATC TGGGTGGGGG TCGTCCCCAG 
£K SS SfcCGC GTTT6ACAGA CGGTTTTCCG AGGAGTGCTT KCaKCK 
GGGAAGGCTC TGGGCAACAT CGCCCGCGTG GTGCCCTATG GGGTCCTGAT CTTCTTCOT TCCTATCCTG 
TCATGGAGAA GAGCCTGGAG TTCTGGCGGG CCCGCGACTT. GGCCAGGAAG ATGGAGGCGC JGAAGCCGCT 
GTTTGTGGAG CCCAGGAGCA AAGGCAGCTT CTCCGAGACC ATCAGTGCTT ACTATGCMG GGHGCCGCC 
CCTGGGTCCA CCGGCGCCAC CTTCCTGGCG GTCTGCCGGG GCAAGGCCAG CGHESGCTG GACTTCTCAG 
ACACGAATGG CCGTGGTGTG ATTGTCACGG GCCTCCCGTA CCCCCCACGC ATGGACCCCC GGGTTGTCCT 
CAAGATGCAG SoS AGATGAAGGG CCAGGGTGGG GCTGGGGGCC AGTTCCTCTC TGGGC GG^G 
TGGTACCGGC AGCAGGCGTC CAGGGCTGTG AACCAGGCCA TCGGGCGAGT 6ATCCGGCAC CGCCAGGACT 
ACGGAGCTGT CTTCCTCTGT GACCACAGGT TCGCCTTTGC CGACGCAAGA GCCCAACTGC CTCCTGGGT 
GCGTCCCCAC GTCAGGGTGT ATGACAACTT TGGCCATGTC ATCCGAGACG TGGCCCAGTT CTTCCffTCTT 
GCCGAGCGM CTATGCCAGC GCCGGCCCCC CGGGCTACAG CACCCAGTGT GCGTGGAGAA ^TGCTGTCA. 
GCGAGGCCAA GTCGCCTGGC CCCTTCTTCT CCACCAGGAA AGCTAAGAGT CTGGACCTGC ATGTCCCCAG 
CCTGAAGCAG AGGTCCTCAG GGTCACCAGC TGCCGGGGAC CaGJGAfiTA 
- CAGGAGCCAG TTCCTGCCGG GCAGAGGCCC AGGGGGCTGC TGGCCGCCCT GGAGCACAGC GAACAGCGGG 

FI G J A 
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CGGG6A6CCC TGGCGAGGAG CAGGCCCACA GCTGCTCCtt CCTGTCCCTC GTGTCTGAGA 
IrArrrrrrr AGAAGAACCG CGAGGAGGGA GGAAGAAGAT CCGGCTGGTC AGOALUtta 
S Sgca CAGACGGACA GGGCCAAGCT cttcatggtg GCCGTGMGC 
AP^rS CCAAGCCAAC TTTGCCACCT TCACCCAGGC CCTGCAGGAC TACAAGGGTT 
S ScMOCTC G CGCCTGTC TCGGCCCCCT CTTTGCTGAG GACCCCAAGA 
GCTCCMGGC TTCTACCAGT TTGTGCGGCC CCACCATAAG CAGCAGTTTG 

====== 

AGC I^rI SrrSrrrr AGGCCACCCC CAAGAGGAGA CCCTGGCAGC CAACCACAGT 

S SS S a™ = ™ 

™ TPrrfT fAfificrcaG GGGTCCGCGG GCTGTAGCCA ACTCTTGGCA bLbUbAUJb 
SS CTCGACAAGG TGCTGGCTGT G = CTGACCAOG 

SffiS SEES SS'SSSS ™ ™ 

£S S AGGCTTGCCG TGCCTCCTGT GCTTACCCAC AGGGCTO C 
■A^rAfirrrr CTCACGGTCC GAGAAGACCG GGAAGACCCA GAGCAAGATC TCGTCCTTCC 

SS Tctgtggggg cgggcggtga ggatgcaggt cccagccact 

rnc^rncr TfCCCACGGG CCTGCAGCAT CTGAGTGGGG CCTCIAGGAT GTGCCCAGCC 
S SS GCAGAGCGTC ATGCAGGTCT t™ G = GA 
GTGCCCACGG AGGCCCCCAG CACACCCAAC GTGGCTTGAT CACCTGCCTG ]£GAGCTCrc 
GTGGGCCAAG AACCCACCCA ACAGMTAGG CCAGCCCATG CCAGCC^T TQSCODBCre 
CAGGCCTCAG GCAGGCGGGG CCCATGGTTG GTCCCTGCGG ™^.T™?^ 
- mCTGAGAA GCCCTGAGCT ACCTTGGGGT CTGGGGTGGG TTTCTGGGAA AGTGCTTCCC 
CTGGCTCCTG GCCTGTGAGT GGTGCCACAG G^CACCCCA GCTOAGCC C 
TCACGGGGAA GGAGGAGACC CCCGTGGGCA CGTGTCCACT TTTAATCAGG GGACAGGGCT 
CTCTAATAAA GCTGCTGGCA GTGCCC CSEQ ID N0:1). 

FIG.1B 
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=5 55E ESI rs; «™s ssss 

•"ii EE ssi sss n= ssss 

iiliil 

S DYKGSODFM LAACLGPLFA EDPKKHNIL.Q GFYQFVRPHH 

SfvhS S EHSIPRRQRA qpvldptgrt apopkltvst amqqldpqe 

S TO™ SQPWGSGVP RAGKQGQHAV SAYLADARRA LGSAGCSQLL 
^S™ DLDKVLAVLA ALTTAKPEDF PLLHRFSHFV RPHHKQRFSQ TCTDLTGRPY 
E^UWPPVLT HRAPQPGPSR SEKTGKTQSK ISSFLRQRPA GTVGAGGEDA 
GPSQSSGPPH GPAASEWGL* (SEQ ID N0:2). 



FIG. 2 
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SEQUENCE LISTING 



<110> APPLICANT: Merck & Co., Inc. 

<120> TITLE: DNA MOLECULES ENCODING HUMAN NHL , A DNA 
HELICASE 

<130> DOCKET/ FILE REFERENCE: 2058*5 PCT 
<160> NUMBER OF SEQUENCES: 38 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 

<210> SEQ ID NO:l 
<211> LENGTH: 4946 - 
<212> TYPE: DNA 
<213> ORGANISM: Homo sapien 

<220> FEATURE: 
<221> NAME /KEY: CDS 
<222> LOCATION: (828) 

<400> SEQ ID NO:l 
agtcagccct gctgccagcc 
actgcgggac agtgagccga 
cagggctctc aggagccggg 
cggtcggctc acactcgagc 
ccgggcaatg gtggtggctc 
gaatgacatc aagggaggag 
tgaggttcac tggccagcgg 
ggagccgaag agcaggcgga 
caacggaccg gaagtggggg 
cccgcgaaaa ctctgagctg 
ggttgagttc ctgagggacc 
tgtgccatag ggattctcga 
ttaccaggag tgcccgagac 
agcctgcccc tcagccacgc 

wee fro bys 
1 

ata gtc ctg aat ggt gtg acc gta gac tec cct ttc cag ccc tac aaa 884 
He Val Leu Asn. Gly Val Thr Val Asp. Phe Pro Phe Gin Pro Tyr Lys 
5 10 15 

tgc caa cag gag tac atg acc aag gtc ctg gaa tgt ctg cag cag aag 932 
Cys Gin Gin Glu Tyr Met Thr Lys Val Leu Glu Cys Leu Gin Gin Lys 
20 ' 25 30 35 



. . . (4487) 



agtgccgggt gctggggact cagggaggee cgccgggacc 60 

gcagaagctg gaacgcagga gaggaaggag agggggeggt 120 

tcctgggcaa ggcgcagccg ttttcaaatt ttcaggaaag 180 

agtaaaaaga tgcctctggg gaggaggece gtgeagctet 240 

ggectagaga ggcggtagtg gaaegcagae cctggtgggg 3 00 

aegggeggga ccccagattt ctgcctgtgg gcgatggaag 3 60 

agccggacac agaacgegea aaacgccgtg. taggcctgga 420 

ccccctccgc gggggaacag tttccgccgg gagcacaaag 480 

gcggaagtgc agtgggctca gcgccgactg cgcgcctctg 540 

gctgacagct ggggacgggt ggcggccctc gactggagtc 600 

ccggttctgg aaggttcgcc gcggagacaa gtgagcagtc 660 

agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc 720 

cctaagatgt tcggagtggt tttttegcac agacccgaat 780 

tctgtgccct tctgagaaca ggctgat atg ccc aag 83 6 



gtg aat ggc ate ctg gag age cct acg ggt aca ggg aag acg ctg tgc 

Val Asn Gly He Leu Glu Ser Pro Thr Gly Thr Gly Lys Thr Leu Cys 

40 45 50 

ctg ctg tgc acc acg ctg gec tgg cga gaa cac etc cga gac ggc ate 

Leu Leu Cys Thr Thr Leu Ala Trp Arg Glu His Leu Arg Asp Gly He 
55/ 60 '65 

tct gec cgc aag att gec gag agg gcg caa gga gag ctt ttc ccg gat 

Ser Ala Arg Lys He Ala Glu Arg Ala' Gin Gly Glu Leu Phe Pro Asp 
70 75 80 



980 



1028 



1076 
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egg gec ttg tea tec tgg ggc aac get get get get get gga gac ccc 1124 
Arg Ala Leu Ser Ser Trp Gly Asn Ala Ala Ala Ala Ala Gly Asp Pro 
85 90 95 

ata get tgc tac acg gac ate cca aag att att tac gee tec agg ace 1172 
He. Ala Cys Tyr Thr Asp He Pro Lys lie He Tyr Ala Ser Arg Thr 
100 105 HO 115 

cac teg caa etc aca cag gtc ate aac gag ctt egg aac acc tec tac 1220 
His Ser Gin Leu Thr Gin Val lie Asn- Glu Leu Arg Asn Thr Ser Tyr 
120 125 130 



egg cct aag gtg tgt gtg ctg ggc tec egg gag cag ctg tgc ate cat 
Arg Pro Lys Val Cys Val Leu Gly Ser Arg Glu Gin Leu Cys He His 
135 140 145 



tec egg aac ctg aag cag caa gec gac ate ata ttc atg ccg tac aat 
Ser Arg Asn Leu Lys Gin Gin Ala Asp He He Phe Met Pro Tyr Asn 
215 220 . 225 



ggg aca gtc gtg ate ttt gac gaa get cac aac gtg gag aag atg tgt 
Gly Thr Val Val lie Phe Asp Glu Ala His Asn Val Glu Lys Met Cys 
245 250 255 

gaa gaa teg gca tec ttt gac ctg act ccc cat gac ctg get tea gga 
Glu Glu Ser Ala Ser Phe Asp Leu Thr Pro His Asp Leu Ala Ser Gly 
260 265 270 275 

ctg gac gtc ata gac cag gtg ctg gag gag cag acc aag gca gcg cag 
Leu Asp Val He Asp Gin Val Leu Glu Glu Gin Thr Lys Ala Ala Gin 
280 285 290 

cag ggt gag ccc cac ccg gag ttc age gcg gac tec ccc age cca ggg 
Gin Gly Glu Pro His Pro Glu Phe Ser Ala Asp Ser Pro Ser Pro Gly 
295 300 305 



1268 



cct gag gtg aag aaa caa gag agt aac cat eta cag ate cac ttg tgc 1316 
Pro Glu Val Lys Lys Gin Glu Ser Asn His Leu Gin He His Leu Cys 
150 155 160 

cgt aag aag gtg gca agt cgc tec tgt cat* ttc tac aac aac gta gaa 1364 
Arg Lys Lys Val Ala Ser Arg Ser Cys His Phe Tyr Asn Asn Val Glu 
165. 170 175 

gaa aaa age ctg gag cag gag ctg gec age ccc ate ctg gac att gag 1412 
Glu Lys Ser Leu Glu Gin Glu Leu Ala Ser Pro He Leu Asp He Glu 
180 185 190 195 

gac ttg gtc aag age gga age aag cac agg gtg tgc cct tac tac ctg 1460 
Asp Leu Val Lys Ser Gly Ser Lys His Arg Val Cys Pro Tyr Tyr Leu 
200 205 210 



1508 



tac ttg ttg gat gee aag age cgc aga gca cac aac att gac ctg aag 1556 
Tyr Leu Leu Asp Ala Lys Ser Arg Arg Ala His Asn He Asp Leu Lys 
230 235 ' 240 



1604 



1652 



1700 



1748 



ctg aac atg gag ctg gaa gac att gca aag ctg aag atg ate ctg. ctg 1796 
Leu Asn Met Glu Leu Glu Asp He Ala Lys Leu Lys Met lie Leu Leu 
310 315 320 



2 
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cgc ctg gag ggg gcc ate gat get gtt gag ctg cct gga gac gac age 
Arg Leu Glu Gly Ala lie Asp Ala Val Glu Leu Pro Gly Asp Asp Ser 



325 



330 335 



ggt gtc acc aag cca ggg age tac ate ttt gag ctg ttt get gaa gee 
Gly Val Thr Lys Pro Gly Ser Tyr lie Phe Glu Leu Phe Ala Glu Ala 
345 350 355 



340 



420 



cag egg tct gat gcc tgg age acc act gca gcc aga aag cga ggg aag 
Gin Arg Ser Asp Ala Trp Ser Thr Thr Ala Ala Arg Lys Arg Gly Lys 

445 450 



440 



gtg ctg age tac tgg tgc ttc agt ccc ggc cac age atg cac gag ctg 
Val Leu Ser Tyr Trp Cys Phe Ser Pro Gly His Ser Met His Glu Leu 

460 465 



455 



500 



505 510 515 



gtc ccc aga ggc ccc gat gga gcc cag ttg age tec gcg ttt gac aga 
Val Pro Arg Gly Pro Asp Gly Ala Gin Leu Ser Ser Ala Phe Asp Arg 
520 525 530 

egg ttt tec gag gag tgc tta tec tec ctg ggg aag get ctg ggc aac 
Arg Phe Ser Glu Glu Cys Leu Ser Ser Leu Gly Lys Ala Leu Gly Asn 
535 540 545 

ate gcc cgc gtg gtg ccc tat ggg etc ctg. ate ttc ttc cct tec tat 
He Ala Arg Val Val Pro Tyr Gly Leu Leu He Phe Phe Pro Ser Tyr 
550 5.55 560 



1344 



1392 



1940. 



cag ate acg ttt cag acc aag ggc tgc ate ctg gac teg ctg gac cag 
Gin lie Thr Phe Gin Thr Lys Gly Cys He Leu Asp Ser Leu Asp Gin 
360 365- 370 

ate ate cag cac ctg gca gga cgt get gga gtg ttc acc aac acg gcc 
He He Gin His Leu Ala Gly Arg Ala Gly Val Phe Thr Asn Thr Ala 
375 380 . 385 

gga ctg cag aag ctg gcg gac att ate cag att gtg ttc agt gtg gac 
Gly Leu Gin Lys Leu Ala Asp He He Gin He Val Phe Ser Val Asp 
390 395 400 

ccc. tec gag ggc age cct ggt tec cca gca ggg ctg ggg gcc tta cag 
Pro Ser Glu Gly Ser Pro Gly Ser Pro Ala Gly Leu Gly Ala Leu Gin 
405 410 415 

tec tat aag gtg cac ate cat cct gat get ggt cac egg agg acg get 2132 
Ser Tyr Lys Val His He His Pro Asp Ala Gly His Arg Arg Thr Ala 
425 430 435 



1988 



2036 



2084 



2180 



2228 



2276 



2324 



gtc cgc cag ggc gtc cgc tec etc ate ctt acc age ggc. acg ctg gcc 
Val Arg Gin Gly Val Arg Ser Leu He Leu Thr Ser Gly Thr Leu Ala 
470 475 480 

ccg gtg tec tec ttt get ctg gag atg cag ate cct ttc cca gtc tgc 
Pro Val Ser Ser Phe Ala Leu Glu Met Gin He Pro Phe Pro Val Cys 
435 490 495 

ctg gag aac cca cac ate ate gac aag cac cag ate tgg gtg ggg gtc 2372 
Leu Glu Asn Pro His He He Asp Lys His Gin He Trp Val Gly Val 



2420 



2468 



2516 
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cct gtc atg gag aag age ctg gag ttc tgg egg gec cgc gac ttg gec 
Pro Val Met Glu Lys Ser Leu Glu Phe Trp Arg Ala Arg Asp Leu Ala 
565 570 575 . 



2564 



agg aag atg gag gcg ctg aag ccg ctg ttt gtg gag ccc agg age aaa 2 612 
Arg Lys Met Glu Ala Leu Lys Pro Leu Phe Val Glu Pro Arg Ser Lys 
580 585 590 595 

ggc age ttc tec gag acc ate agt get tac tat gca agg gtt gec gec 2660 
Gly Ser Phe Ser Glu Thr He Ser Ala Tyr Tyr Ala Arg Val Ala Ala 
600 605 610 



cct ggg tec acc ggc gec acc ttc ctg gcg gtc tgc egg ggc aag gec 
Pro Gly Ser Thr Gly Ala Thr Phe Leu Ala Val Cys Arg Gly Lys Ala 
615 620 625 



aac cag gec ate ggg cga gtg ate egg cac cgc cag gac tac gga get 
Asn Gin Ala He Gly Arg Val He Arg His Arg Gin Asp Tyr Gly Ala 
695 700 705 



2708 



•age gag ggg ctg gac ttc tea gac acg aat ggc cgt ggt gtg att gtc . 2756 
Ser Glu Gly Leu' Asp Phe Ser Asp Thr Asn Gly Arg Gly Val He- Val 
630 635 640 

acg ggc etc ccg tac ccc cca cgc atg gac ccc egg gtt gtc etc aag 2804 
Thr Gly Leu Pro Tyr Pro Pro Arg Met Asp Pro Arg Val Val Leu Lys 
645 650 655 

atg cag ttc ctg gat gag atg aag ggc cag ggt ggg get ggg ggc cag. 
Met Gin Phe Leu Asp Glu Met Lys Gly Gin Gly Gly Ala Gly Gly Gin 
660 665 670 675 

ttc etc tct ggg cag gag tgg tac egg cag cag gcg tec agg get gtg 2900 
Phe Leu Ser Gly Gin Glu Trp Tyr Arg Gin Gin Ala Ser Arg Ala Val 
680 685 690 



2852 



2948 



gtc ttc etc tgt gac cac agg ttc gee ttt gee gac gca aga gec caa 2996 

Val Phe Leu Cys Asp His Arg Phe Ala Phe Ala Asp Ala Arg Ala Gin - 

710 715 720 

ctg ccc tec tgg gtg cgt ccc cac gtc agg gtg tat gac aac ttt ggc 

Leu Pro Ser Trp Val Arg Pro His Val Arg Val Tyr Asp Asn Phe Gly 
725 730 735 

cat gtc ate cga gac gtg gec cag ttc ttc cgt gtt gee gag cga act 

His Val He Arg Asp Val Ala Gin Phe Phe Arg Val Ala' Glu Arg Thr 
740 745 750 755 

atg cca gcg ccg gec ccc egg get aca gca ccc agt gtg cgt gga gaa 

Met Pro Ala Pro Ala Pro Arg Ala Thr Ala Pro Ser Val Arg Gly Glu 
760 765 770 



3044 



3092 



3140 



gat get gtc age gag gee aag teg cct ggc ccc ttc ttc tec acc agg 3188 
Asp Ala Val Ser Glu Ala Lys Ser Pro Gly Pro Phe Phe Ser Thr Arg 
775 780 785 

aaa get aag agt ctg gac ctg cat gtc ccc age ctg aag cag agg tec 3236 
Lys Ala Lys Ser Leu Asp. Leu His Val Pro Ser Leu Lys Gin Arg Ser 
790 795 ' 800 
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tea ggg tea cca get gec ggg gac ccc gag agt age ctg tgt gtg gag 3234 
Ser Gly Ser Pro Ala Ala Gly Asp Pro Glu Ser Ser Leu Cys Val Glu 
805 810 815 

tat gag cag gag cca gtt cct gec egg cag agg ccc agg ggg ctg ctg 3332 
Tyr Glu Gin Glu Pro Val Pro Ala Arg Gin Arg Pro Arg Gly Leu Leu 
820 825 830 835 

gec gec ctg gag cac age gaa cag egg gcg ggg age cct ggc gag gag 3380 
Ala Ala Leu Glu His Ser Glu Gin Arg Ala Gly Ser Pro Gly Glu Glu 
840 845 850 

cag gec cac age tgc tec acc ctg tec etc ctg tct gag. aag agg ccg 3428 
Gin Ala His .Ser Cys Ser Thr Leu Ser Leu Leu Ser Glu Lys Arg Pro 
855 8'60 , 865 

gca gaa gaa .ccg cga gga ggg agg aag aag ate egg ctg gtc age cac 3476 
Ala Glu Glu Pro' Arg Gly Gly Arg Lys Lys He Arg Leu Val Ser His 
870 875 880 



ccg gag gag ccc gtg get ggt gca cag acg gac agg gec aag etc ttc 3524 
Pro Glu Glu Pro Val Ala Gly Ala Gin Thr Asp Arg. Ala Lys Leu Phe 
885 890 895 

atg gtg gee gtg aag cag gag ttg age caa gee aac ttt gee acc ttc 3572 
Met Val Ala Val Lys Gin Glu Leu Ser Gin Ala Asn Phe Ala Thr Phe 
900 905 910- 915 

acc cag gee ctg cag gac tac aag ggt tec gat gac ttc gee gee ctg 3620 
Thr Gin Ala Leu Gin Asp Tyr Lys Gly Ser Asp Asp Phe Ala Ala Leu 
920 925 930 

gec gee tgt etc ggc ccc etc ttt get gag gac ccc aag aag cac aac 
Ala Ala Cys Leu Gly Pro Leu Phe Ala Glu Asp Pro Lys Lys His Asn 
935 940 945 

ctg etc caa ggc ttc tac cag ttt gtg egg ccc cac cat aag cag cag 
Leu Leu Gin Gly Phe Tyr Gin Phe Val Arg Pro His His Lys Gin Gin 
950 955 960 

ttt gag gag gtc tgt ate cag ctg aca gga cga ggc tgt ggc tat egg 
Phe Glu Glu Val Cys lie Gin Leu Thr Gly Arg Gly Cys Gly Tyr Arg 
965 970 975 . , 

cct gag cac age att ccc cga agg cag egg gca cag ccg gtc ctg gac 3812 
Pro Glu His Ser He Pro Arg Arg Gin Arg Ala Gin Pro Val Leu Asp 
980 985 990 995 



ccc act gga aga acg gcg ccg gat ccc aag ctg acc gtg tec acg get 
Pro Thr Gly Arg Thr Ala Pro Asp Pro Lys Leu Thr Val Ser Thr Ala 
1000 1005 1010 

gca gec cag cag ctg gac ccc caa gag cac ctg aac cag ggc agg ccc 
Ala Ala Gin Gin Leu Asp Pro Gin Glu His Leu Asn Gin Gly Arg Pro 
1015 1020 1025 



36*68 



3716 



3764 



3860 



3908 



cac ctg teg ccc agg cca ccc cca aca gga gac cct ggc age caa cca 3956 
His Leu Ser Pro Arg Pro Pro Pro Thr Gly Asp Pro Gly Ser Gin Pro 
1030 1035 . 1040 
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cag tgg ggg tct gga ,gtg ccc aga gca ggg aag cag ggc cag cac gcc 
Gin Trp Gly Ser Gly VaL Pro Arg Ala Gly Lys Gin Gly Gin His Ala 
1045 1050 1055 

gcg-agc gcc tac ctg get gat gcc cgc agg gcc ctg ggg tec gcg ggc 
Val Ser Ala Tyr Leu Ala Asp Ala Arg Arg Ala Leu Gly Ser Ala Gly 
1060 1065- 1070 1075 

tgt age caa etc ttg gca gcg ctg aca gcc tat aag caa gac gac gac 
Cys Ser Gin Leu Leu Ala Ala Leu Thr Ala Tyr Lys Gin Asp Asp Asp 
1080 1085 1090 

etc gac aag gtg ctg get gtg ttg gcc gcc ctg acc act gca aag cca 
Leu Asp Lys. Val Leu Ala Val Leu Ala Ala Leu Thr Thr. Ala Lys- Pro 
1095 1100 1105 



cag tec tea gga cct ccc cac ggg cct gca gca tct gag tgg ggc etc 
Gin Ser Ser Gly Pro Pro His Gly Pro Ala Ala Ser Glu Trp Gly Leu 
1205 1210 1215 



4004 



4052 



4100 



4148 



gag gac. ttc ccc ctg ctg. cac agg ttc age atg ttt gtg cgt cca cac 4196 
Glu Asp Phe Pro Leu Leu His Arg Phe Ser Met Phe Val Arg Pro His 
1110 1H5 1120 

cac aag cag cgc ttc tea cag acg tgc aca gac ctg acc ggc egg- ccc • 
His Lys Gin Arg Phe Ser Gin Thr Cys Thr Asp Leu Thr Gly Arg Pro . 
1125 1130 1135 

tac ccg ggc atg gag cca ccg gga ccc cag gag gag agg ctt gcc gtg 
Tyr Pro Gly Met Glu Pro Pro Gly Pro Gin Glu Glu Arg Leu Ala Val 
1140 1145 ' 1150 1155 

-cct cct gtg ctt acc cac agg get ccc caa cca ggc ccc tea egg tec 
Pro ^ro Val Leu Thr His Arg Ala Pro Gin Pro Gly Pro Ser Arg Ser 
1160 H65 1170 

gag aag acc ggg aag acc cag age aag ate teg tec ttc ctt aga cag 
Glu Lys Thr Gly Lys Thr Gin Ser Lys He Ser Ser Phe Leu Arg Gin 
1175 11-80 1185 



4244 



4292 



4340 



4388 



agg cca gca ggg act gtg ggg gcg ggc ggt gag gat gca ggt ccc age 4436 
Arg Pro Ala Gly Thr Val Gly Ala Gly Gly Glu Asp Ala Gly Pro Ser 
1190 H95 1200 



4484 



tag gatgtgccca gcctgccaca ccgcctccag gaagcagagc gtcatgeagg 4537 



tcttctggcc agagccccag tgagtgccca cggaggcccc cagcacaccc aacgtggctt 4597 

gatcacctgc ctgtccagct ctggtgggcc aagaacccac ccaacagaat aggccagccc 4657 

atgccagccg gcttggcccg ctgcaggcct caggcaggcg gggcccatgg ttggtccctg 4717 

cggtgggacc ggatctgggc ctgcctctga gaagccctga gctaccttgg ggtctggggt 4777 

gggtttctgg gaaagtgctt ccccagaact tccctggctc ctggcctgtg agtggtgcca 4837 

caggggcacc ccagctgagc ccc.tcaccgg gaaggaggag acccccgtgg gcacgtgtcc 4897 

acttttaatc aggggacagg gctctctaat aaagctgctg . gcagtgccc 4946 
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<210> SEQ ID NO: 2 

<211> LENGTH: 1219 

<212> TYPE : PRT 

<213> ORGANISM: Homo sapien 



<400> SEQ ID NO: 2 

Met Pro Lys He Val Leu Asn Gly Val Thr Val Asp Phe Pro, Phe Gin 

1 5 10 15 

Pro Tyr Lys Cys Gin Gin Glu Tyr Met Thr Lys Val Leu Glu Cys Leu 

20 25 30 

Gin Gin Lys Val Asn Gly He Leu. Glu Ser Pro Thr Gly Thr Gly Lys 

35 40 45 

Thr Leu Cys Leu Leu Cys Thr Thr Leu Ala Trp Arg Glu His. Leu Arg 

SO 55 60 

Asp Gly He Ser Ala Arg Lys lie Ala Glu Arg Ala Gin Gly Glu Leu 
65 70 75 80 

Phe Pro Asp Arg Ala Leu Ser Ser Trp Gly Asn Ala Ala Ala Ala Ala 

85 90 95 

Gly Asp Pro He Ala Cys Tyr Thr Asp He Pro Lys He He Tyr Ala 

100 105 HO 

Ser Arg Thr His Ser Gin Leu Thr Gin Val He Asn Glu Leu Arg Asn 

115 120 125 

Thr Ser Tyr Arg Pro Lys Val Cys Val Leu Gly Ser Arg Glu Gin Leu 

130 . 135 140 

Cys He His Pro Glu Val Lys Lys Gin Glu Ser Asn His Leu Gin He 
145 150 155 160 

His Leu Cys Arg Lys Lys Val Ala Ser Arg Ser Cys His Phe Tyr Asn 

165 170' 175 

Asn Val Glu Glu Lys Ser Leu Glu Gin Glu Leu Ala Ser Pro He Leu 

130 185 190 

Asp He Glu Asp Leu Val Lys Ser Gly Ser Lys His Arg Val Cys Pro 

195 200 205 

Tyr Tyr Leu Ser Arg Asn Leu Lys Gin Gin Ala Asp He He Phe Met 

210 215 220 

Pro Tyr Asn' Tyr Leu Leu Asp Ala Lys Ser Arg Arg Ala His Asn lie 
225 230 235 240 

Asd Leu Lys Gly Thr Val Val lie Phe Asp Glu Ala His Asn Val Glu 

245 250 255 

Lys Met Cys Glu Glu Ser Ala Ser Phe Asp Leu Thr Pro His Asp Leu 

260 265 270 

Ala Ser Gly Leu Asp Val He Asp Gin Val Leu Glu Glu Gin Thr Lys 

275 280 285 

Ala Ala Gin Gin Gly Glu Pro His Pro Glu Phe Ser Ala Asp Ser Pro 

290 295 300 

Ser Pro Gly Leu Asn Met Glu Leu Glu Asp lie Ala Lys Leu Lys Met 
305 310 315 320 

He Leu Leu Arg Leu Glu Gly Ala He Asp Ala Val Glu Leu Pro Gly 

325 330 335 

Asp Asp Ser Gly Val Thr Lys Pro Gly Ser Tyr He Phe Glu Leu Phe 

340 345 350 

Ala Glu Ala Gin lie Thr Phe Gin Thr Lys Gly Cys lie Leu Asp Ser 

355 360 365 

Leu Asp Gin lie lie Gin His Leu Ala Gly Arg Ala Gly Val Phe Thr 

370 • 375 380 

Asn Thr Ala Gly Leu Gin Lys Leu Ala Asp He lie Gin He Val Phe 
385 390 395 400 

Ser Val Asp Pro Ser Glu Gly Ser Pro Gly Ser Pro Ala Gly Leu Gly 

405 410 415 

Ala. Leu Gin Ser Tyr Lys Val His He His Pro Asp Ala Gly His Arg 
420 4-25 43 0 
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Arg Thr Ala Gin Arg Ser Asp Ala Trp Ser Thr Thr Ala Ala.Arg Lys 

435 440 445 

Arg Gly Lys Val Leu Ser Tyr Trp Cys Phe Ser Pro Gly His Ser Met 

450 455 460 

His Glu Leu Val Arg Gin Gly Val Arg Ser Leu He Leu Thr Ser Gly 
465 470 475 480 

Thr Leu Ala Pro Val Ser Ser Phe Ala Leu Glu Met Gin He Pro Phe 

485 490 - 495 

Pro Val Cys Leu Glu Asn Pro His He He Asp Lys- His Gin He Trp 

500 505 510 

Val Gly Val Val Pro Arg Gly Pro Asp Gly Ala Gin Leu Ser Ser Ala 

515 520 525 

Phe. Asp Arg Arg Phe Ser Glu Glu Cys Leu Ser Ser Leu Gly Lys Ala 

530 535 540 

Leu Gly Asn He Ala Arg Val Val Pro Tyr Gly Leu Leu He Phe Phe 
545 550 555 560 

Pro Ser Tyr Pro Val Met Glu Lys Ser Leu Glu Phe Trp Arg Ala Arg 

56$ 570 575 

Asp Leu Ala Arg Lys Met Glu Ala Leu Lys Pro Leu Phe Val Glu Pro 

580 585 590 

Arg Ser Lys Gly Ser Phe Ser Glu Thr He Ser Ala Tyr Tyr Ala Arg 

595 600 605 

Val Ala Ala Pro Gly Ser Thr Gly Ala Thr Phe Leu Ala Val Cys Arg 

610 615 620 

Gly Lys Ala Ser Giu Gly Leu Asp Phe Ser Asp Thr Asn Gly Arg Gly 
625 - 630 635 640 

Val He Val Thr Gly Leu Pro Tyr Pro Pro Arg Met Asp Pro Arg Val 

645 650 655 

Val Leu Lys Met Gin Phe Leu Asp Glu Met Lys Gly Gin Gly Gly Ala 

6-60 665 670 

Gly Gly Gin Phe Leu Ser Gly Gin Glu Trp Tyr Arg Gin Gin Ala Ser 
675 680 685 

" Arg Ala Val Asn Gin Ala He Gly Arg Val He Arg His Arg Gin Asp 
690 695 700 

Tyr Gly Ala Val Phe Leu Cys Asp His Arg Phe Ala Phe Ala Asp Ala 
705 710 715 720 

Arg Ala Gin Leu Pro Ser Trp Val Arg Pro His Val Arg Val Tyr Asp 

725 730 735 

Asn Phe Gly His Val lie Arg Asp Val Ala Gin Phe Phe Arg Val Ala 

740 745 750 

Glu Arg Thr Met Pro Ala Pro Ala Pro Arg Ala Thr Ala Pro Ser Val 

755 760 765 

Arg Gly Glu Asp Ala Val Ser Glu Ala Lys Ser Pro Gly Pro Phe Phe. 

770 775 780 

Ser Thr Arg Lys Ala Lys Ser Leu Asp Leu His Val Pro Ser Leu Lys 
785 790 795 800 

Gin Arg Ser Ser Gly Ser Pro Ala Ala Gly Asp Pro Glu Ser Ser Leu 

805 810 815. 

Cys Val Glu Tyr Glu Gin Glu Pro Val Pro Ala Arg Gin Arg Pro Arg 

820 825 830 

Gly Leu Leu Ala Ala Leu Glu His Ser Glu Gin Arg Ala Gly Ser Pro 

835 840 845 

Gly Glu Glu Gin Ala His Ser Cys Ser Thr Leu Ser Leu Leu Ser Glu 

•850 855 860 

Lys Arg Pro' Ala Glu Glu Pro Arg Gly Gly Arg Lys Lys He Arg Leu 
865 870 875 880 

Val Ser His Pro Glu Glu Pro Val Ala Gly Ala Gin Thr Asp Arg Ala 

' 885 890 895 

Lys Leu Phe Met Val Ala Val Lys Gin Glu Leu Ser Gin Ala Asn Phe 
900 905 910 
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Ala 


Thr 


Phe 


Thr 


bin 


rt.±a. ilCU 0x11 


Asp 




Lvs 


Gly Ser Asp 


Asp 


Phe 




9lD 






920 








925 






Ala 


Ala 


Leu 


Ala 


Aia 


Gys Leu Gly 


Pro 


Leu 


Phe 


Ala Glu Asp 


Pro 


Lvs 




930 








935 








940 






Lys 


His 


Asn 


Leu 


Leu 


Gin Gly Phe- 


Tyr 




Phe 


Val Ara Pro 


His 


His 


945 










950 






955 






960 


Lys 


Gin 


Gin 


Phe 


Glu 


Glu Val Cys 


lie 


fl n 


Leu 


Thr Gly Arg 


Glv 


Cvs 








965 






970 






975 




Gly 


Tyr 


Arg 


Pro 


Glu 


His Ser lie 


Pro 


Arg 


Arg 


Gin Arg Ala 


Gin 


Pro 




980 






985 






990 






Val 


Leu 


Asp 


Pro 


Thr 


'Gly Arg Thr 


Ala 


Pro 


Asp 


Pro Lys Leu 


Thr 


Val 






995 






1000 






1005 






Ser 


Thr 


Ala 


Ala 


Ala 


Gin Gin Leu 


Asp 


Pro 


Gin- 


Glu His Leu 


Asn 


Gin 




1010 






1015 








1020 






Gly 


Arg 


Pro 


His 


Leu 


Ser Pro Arg 


Pro 


Pro 


Pro 


Thr Gly Asp 


Pro 


Gly 


102! 








1030 






1035 




1040 


Ser 


Gin 


Pro 


Gin 


Trp 


Gly Ser Gly Val 


Pro 


Arg Ala Gly Lys 


Gin 


Gly 










1045 




1050 




1055 


Gin 


His 


Ala 


Val 


Ser 


Ala Tyr Leu 


Ala 


Asp 


Ala 


Arg Arg Ala 


Leu 


Gly 



1060 1065 1070 

Ser Ala Gly Cys Ser Gin Leu Leu Ala Ala Leu Thr Aia Tyr Lys Gin 

1075 1080 1085 

Asp Asp Asp Leu Asp Lys Val Leu Ala Val Leu Ala Ala Leu Thr Thr 

1090 1095 1100 

Ala Lys Pro Glu Asp Phe Pro Leu Leu His Arg Phe Ser Met Phe Val 
110.5 1110 1115 1120 

Arg Pro His His Lys Gin Arg Phe Ser Gin Thr Cys Thr- Asp Leu Thr 

1125 1130 1135 

Gly Arg Pro Tyr Pro Gly Met Glu Pro Pro Gly Pro Gin Glu Glu Arg 

1140 1145 H50 

Leu Ala Val Pro Pro Val Leu Thr His Arg Ala Pro Gin Pro Gly Pro 

1155 1160 H65 

Ser Arg Ser Glu Lys Thr Gly Lys Thr Gin Ser Lys He Ser Ser Phe 

1170 1175 1180 

Leu Arg Gin Arg Pro Ala Gly Thr Val Gly Ala- Gly Gly Glu Asp Ala 
1135 H90 1195 1200 

Gly Pro Ser Gin Ser Ser Gly Pro Pro His Gly Pro Ala Ala Ser Glu 
1205 1210 1215 

Trp Gly Leu 



<210> SEQ ID NO: 3 

<211> LENGTH: 114793 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapien 

<400> SEQ ID NO: 3 

tgaagagctt tgaccaagag gctgtgacga ggccctacga ggactctggc tctcctcctg 60 

ctaagcacac ccaggcaggt gtcctggcag atgaggacca catgcagagc ctcggecagc 120 

ccaccaatgc ccggatatgc aagtgagccc agcctggacc ccccggcgag gcccagcagc 

accagcccag gcccgaaaac cttaagaaat gaccagtgtc tgctgcttta agccaccaag 

ctctgcggtg gtttgttagg ctgcaagcat ggctaattca gaaactgcca gaaacaagca 

ctgctgtccc cagcctggga cacacagcac cgcctctgcg tggggagagg gcacaggcta . 

agggcacaaa tg.ccatccca gacccggctc ttgtgtgtgg aaggggccac tgtgccatga 

ggcagaggaa accttggcag gaccttatgc cacagcaatt taaaagagaa gaaacaggct 

gggcgtggtg gctcatgcct ataatcccag cactttggga ggccaaggtg gtggatcact 540 

tgaggtcagg agttcaagac cagcctggcc aatatggtga. aaccctgtct ctacgaaaaa 

tacaaaattt aggcaggcgt ggtggcgggt gcctgtaa-tc cctgctattc aggaggctga 

ggcaagagat ttacttgaac ccaggaggtg gaggctgctg cagtgagctg agatcatgcc 720 

actgcactcc agcctgtgtg acggagtgag acttggtctc aaaaaaaaaa aaggaaacac 780 



180 
240 
300 
360 
420 
480 



600 
660 
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atctgactag "tgtgatctcg caaggaacat tccagacaca gtggagctag aaggttcttc 840 

tccaaacaag gaatccccag gggatcaaat tgttttgcat cggccagaca tggtggctca 900 

agcctgtaac cccagtgctt cgggaggctg aggtgggagg actgcctgag tccaggagtt 960 

caagactagc ttgggcaaca cagtgagagc ccattagcca ggcgtggtgg cacatgcctg 1020 

cagtcccagc actgtactaa aaatctacac ggggccgggc atggtggcac atgcctgtag 1080 

agtcccagcC actcaggagg ctgaggcagg acgattcctt gaacccagga ggtcacggct 1140 

gccatgagcc g.tgactgtgc cactgcactc cagtctgtgc aacagaacga gactctgttt 1200 

cgaaaaacaa aaaatcattt catgtctcca gtttctccac tggcaaaaga ctctgtcaag 12 60 

gtaaaaaatg gttctgaccc acagaaatct aagaaaggaa aaaatataaa aaatagaaaa , 1320 

tttaaaaaag agatggtctc agaataaaga ccaacctggg ctatggttgt cactcttccc 1380 

tcacacctta gaaagctttc tggccgcatc tggccaaagg gccaccctgc cccatcttgg 1440 

atcagtgagg tgccttcgaa caagccacct gccctggagc ccgtcctgcc ttgtctgcca 1500 

ccgcacgctc agtaggggag' gggaagtcgc taggttttag ttcaccagtc tctggatcaa 1560 

gacgtgccat aaccaagaag ccccagccac acccagaccc gatgtggcca caaggggtga 1620 

gctgggaagg cccaggaaaa ggcgggaggc ggacgaatgg aaatgtcatt ctgtggccac 1680 

agaaatgatc tcaacgtttt gcaacttcct accaagaggc agtcttagcc ctgcccttga. 1740 

accagcactt ggtgatgtcg cttgcgtcaa tcaaggcaac agaagtgagc aggaggccca 1800 

ctttcctctg caactgtggg cttacggggc aaagaagtcc aggcccccag gtggaggatc 1860 

acagaccggg caaagcagag gagagccacc cagccgagcc tacctgtgcc tcagactgcc 1920 

tccctccaga gacccctgtg gccaaggcca cccagaccag caggtccttg ccaagctgtc 1980 

agctgacgac aggggttggt gaggccggcc cagaccagca gaaccacgaa ccaaccaaca 2040 

gaattaaaaa taataacaac tatgtcttgt ctcaagccac taagttctgg atggtttctt 2100 

tcttcctttt tctttttctt tttcggagac gcagtctcac tctgttgccc aggctggagt 2160 

gcagtggcgc aatcttggct cactgcaagc tctgcccccc ggattcacgc cattcccctg 2220 

cctcagcctc ctgagtaact gggactacag gtgcctgcca ttgggtgttt tcttaaacag 2280 

caaaagaaaa ctgacacaat cataaacaga gcaagcaaga gaactcggca attatttcct 2340 

ctctacttct cactgttctt caaagagtta actcaagcat aagatgtgag caaattcttt 2400 

taacatccta gaaaaaaagc tcctactcag tgttcacaaa gcaaagctaa cctacaggag 2460 

ccaccttcca cagtgaccac aggaaaccaa gacagcaagt gggacaccag cctccagggc 2520 

actgcgccag ccgtgcgcct gtgtctgcca ctgccctggt ccgtcactgc caccagccgg 2580 

caagacaccc acagaggaga gctctaagcc acaactgtgt acgaagacaa ctgtgcagga- 2640 

ttttattact acaacacttt tgttttcttt- tttttttttt tttgagactg agtctcgctc 2700 

tgtcacccag gctggagtgc agtggcacaa tctcggctca ctgtaacctc catctccctg 2760 

gttcaagcaa ttctcctgct gcagcctccc aactggatta caggcgcccg ccaccacgcc 2820 

tggctaattt ttgtactttt agtagagatg gggtttcacc atgttggcca gactggtctc 2880 

aaattcctga caagtgatcc acccaccctg gcctcccaaa gtgctgggat tacaggtgtg 2940 

agccaccgcg cctggcccat ttttgtttat caataaaaac gtacttaatg ttgaactccc 3000 

cacatttcaa atgggtaact ccagtgtcct tgatgctcct gcgacatgct cgtgagactt 3060 

ctcttgggtg tgagagtcca gcatgtgggt ggtctggaca ggagggggag ggaagagtgc .3120 

agagccgggc agggtaaaga gaccccccag gatgtgaagg ccgccctgca tttgtcagac . 3180 

tgggcaacac ccactccatc agatggaccc tggtatgggc ggcaagccac ctaggtgccg, 3240 

aggcaagaga ccgagggcac gagctgttcc ggtgtaataa aatgcataaa ataagaatag 3300 

ttatactaga tatagatcat aaatatgatt atatatgaat atcattcatc attagtttgt 3360 

agcaattact ctttattcca atattataat aatccttgcc taagcataac ctaggaaaaa 3420 

ctaggaaatc ataacctagg aaaaactagg ccatacagag ataggagctg aggggacata 3480 

gtgagaactg accagaagac aagagtgcga gccttctgtt atgcctggac agggccacca 3540 

gagggctcct tggtctagcg gtaacgccag cacctgggaa gacgcccgtt gccaagtgga 3600 

ccgtggtcta gcggtagcct cagtgtcaag gaaaaacacc cgctacttag caaaccagga 3660 

aagagagtct ccctttcccc gggggagttt agagaagact ctactcctcc acctctcgcg 3720 

'gagggcctga catcagtcag gcccgcccgc agttatccgg aggcctaacc gtctccctgt 3780 

gatgctgtgc ttcagtggtc acgctcctag tccgccttca tgttccatcc tgtgcacctg 3840 

gctctgcctt ctagatagca gcagcaaatt agtgaaagta ctgaaagtct ctgataagca 3900 

gaaataatgg cgtaagcggt ctctctctct ctctcctctc tctctgcctc agctgccagg '3960' 

aagggaaggg ccccctggcc agtgggcacg tgacccacat gaccttacct atcactggac • 4020 

atggttcaca ctccttaccc tgccgct.ttg tcttgtatcc aataaatagc gcaacctggc 4080 

attcggggcc gctaccagtc tccgcgtctt ggtggtagtg gtcccccagg cccagctgtc 4140 

tctttctttt atctttgtct tgtgtcttta tttctacact ctctcatctc cgcatacgag 4200 

gagaaaaccc accaaccctg tggggctggt ccctacaccc tggctttgta gactggagcc 4260 

taggcacgac tcagctgctg tagtgaattg cgatcctcca aacccagcaa ggcacctgca- 4320 

ggacatctgg cccagtctcc tcgttgagcc. agttcacgaa aaagagactt ttctgagtga 4380 

catgctaatg ggcaatatga ggactaaatg ggatggtctc caactcggac aaaccaacag 4440 
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taaaagccac tttgcgggga aagaaacttt tccttttttc ttttttttga gacaggatct 4500 
caccctgtca cccaggctgc agtgcagtgg catgaccttg gctcactgca gcctcaacct ' 4560 

ctcccaggct caagcaatcc ccccgcctca acctcccatg cagctgggac cataggtgca 4620 

tgccaccaca cccaaataat ttttaeattt tttgtagaga cgaggtntca ctatgtcgct 4680 

cgggctggtc tcaactcctg ggctcaagca accctcccac ctcagcctcc caaagcgctc 4740 

agactacagg caggagccac caggcctggc caacatagga agaaatttaa acttgaactg 4300 

aatattagaa gagatgaaaa ttcatcaaca tggaaagaca aagatcatta actaaagcca 48 60 

aaccagaatg gaagctgtgt gtacagtggg gtctcatgct gggaacgcga ggggcacgtg 4920 

cagggctcca cggtgtggcg acgccccatg ctccctttgt gggggttcat ccagcggaac 4980 

atgaggacct ggggtgcttc tcaacacgta cgtgagttta ataataaaaa ggtttaagga 5040 

aagaaaaatt catatgtttc tatataaaca gaacatctgg aaagatctat tctaaggtgt 5100 

tgacagtagg aatctctagg tagtagtaat atggcctttt tgaatttttg cttatcagta 51-60 

ttttctaatt ttctttttct ttctaaataa ttctagctat gaaacaattt tctaccatat 5220 

atattttgta ataaaaatgg ttatatttaa ttttttaaag gctgtacaaa cttcctgata 5280 ' 

aaatggcaaa ttagacacac acatgtgggc cgggtacagt ggctcgcgcc tgtaattcca 5340 

gcactttggg aggctgaggc aggcagatca cctaaggtca ggagtttgag accagcctgg 5400 

ccaacatggt gaaaccccgt ctctactaaa tatacaaaaa tgagctggat gtggtggcac 5460 

'acacctatag tgccagctac ttgggaagct gaggcaggaa aattgcttca acccgggagg 5520 

cagaggttgt agtgagccga gatcatgcca ctgcactcca gcctaggcaa caagagcgag 5580 

actccaactc aaaaaaaaat aaaaataaca cacacgtgaa taggctcctc atggaagtca 5640 

tcacaacaat gcagagggaa gagcttccaa agtgtaaacc cagaagcgag gagcaggagg 5700 

gtgcgcgcag acgcagagag cagcaaggtg cagactgaga ggcggaggct ggccgtgggg 57 60 

agatgactga tgctcagttt ataccccaaa tccgt'aaatc tagaggcctg gcacatcaac 5820 

tacctctgcc agcaggaatg agggaaagga gggcaaccaa aagatgtccc accctcaccc 5880 

atccagctac ccgccatcct cagccccact ggcagaagac cccgagaggt ggaggcaggc 5940 

ccctgcctac aggaccctga gagctagggg aaggcgttat cccgaactgt gtcccccgta 6000 

aaattcatat gttgaaggcc ccatccccag tgtgactgta tttaaagacg gggtcttcag 6060 

gagataattt aaatgaggtc atataagttg gccctcatcc agtaagactt tgacctztctg 6120 

gtggtttttt tttttttgga gactgggtct cactctatca ctcaggttgg agtacagtgg 6180 

cacgatcacg gctcactgct gtctccaact cctgggctca ggtgatcctc ctgcttcagc 6240 

ctcctgagta gctgggacta caggtgctta ccaccgcacc cagctggtgg tgcactgtgt 6300 

tttttgtaga gatggggttt tgccatgtcg cccaggctgg tcctgaactg ggctcaagtg 6360 

atctgtctcc ctcggcctcc tgcagtgctg gaattacagg tatgagccac cgcgcctggc 6420 

cgaccgtgac cttctaagaa gtgaaagaga aagatcttcc tctccccctc cctctccatc 6480 

atgaggacac agcaagaagt cggccatctg caaggtagaa agcgagtcct cccaacagct 6540 

gaacctggca gaccctgatc ttggacttca gccttcagag ctgtaagaaa ataactctct 6600 

gctgttcagg ccacgcggtc tacggcagcc cgagcagact aagacacacg ccatctgggg 6660 

agtcagacca gatcaggaag aaaggcctag agctcaggat actgaaggtc ccaacccggt 6720 

gctggaccag accaccccgg cagccgcggc . cacggagtca cggctcgggt gaggtgacct 6780 

ggacaccatc ccggcagccg cggccacgga gtcacggctc gggtgaggtg acctggacac 6840 

caccccggca gccgcggcca cggtgtcacg gctcggatga gatgactcgg acaccacccc 6900 

ggcagccgcg gccacggtgt cagggctcag gtgaggagag ttggatatgg gactgggcct 696 0 

accccgaggc tgcttccacc, cagacgcctg ggtgggtgac acgaaagctg ggctcagttg 7020 

ggatcagagc agcctccccc caggtcagaa atgaccctgg gctcctcaca gtagccctag 7080 

ggcaccatga gaaagctzacg tggacttctc tgaccaaggg tcactgctgc cacactactc 7140 

attgcaggcc acgtcagggc tcagccgagg agacgtggac accaccccag cagccgcggc 7200 

cacggcgtcc caagggaggg acttgggcac cgcctctctg ggcaagagtg gggaggtgtg 7260 

gggtgggaga tgtctggaaa catcatggac acatgccggg aaaacacgga agctgtgcac 7320 

caaggtgctg acaaaggaaa aaggagaatg gaggtgtgaa catccagcta gcaggtccca 7380 

ctcagaaact cccgcatttc cagacatggc caccagctct gtggatgaga caggggagga 7440 

cagggtacct cacaccagga acccacacag gcccatgtct tgctctgtga ccacacaaca 7500 

gcctccacca ccctgacatg caggagggag gtcaaagcct cgggtccaac aacaggctcc 7560 

acagcaaggg aagaaaggca ggaaggaact cagggccagg- tcctcccagg cagcagctgc 7620 

ctgcacgctg tccaccaagg gaggtctgac ctacaccgca caggggttgg cagtctagag 7 680 

tcgtcctctg tcaaacggtg agaaagtcaa aagctcatgc ccagtgatac gctaggtcag 7740 ~ 

catgaagatg ccacacatga gacacagcaa ggatgagac'c aacgggaaga ctgccccaga 7800 

ccagagcccc agagccctct ggggaggaag aacaaggatg gcagcctggg actgcccggg 7860 

gctgactctg cctttatttc accccagcag aggcaggagt gacaccggct cacagcagga 7920 

gcagctctgc cacctcctag cagttccacc cacgggcagc aaaacaaagc tggcagtttg 7980 
ggcaaatgtt agcgtttttg ccaactaaca ttcgaatcgg acatctggta cagagatgag . 8040 

gaagaaaaca ctcacagttt catgaagact gtcaagaaaa tcactgactc ttcacttcat 8100. 
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ttatgaaagg ccagctctct gacatcccta ccactccctc tcacatgaga aatcacggcc 8160 

tttcaggacg tggagccacg tggccatgca ggtacgggag gcctccccgc agctgcagct 8220 

gggtcttctg gtccccgtgc catttctgct tttcttcgct ctctacttac acacacattt 8280 

gagtccagtc tcagaagaac cggaactaga aaaatcctga cacttgtccc ttactacgtt 3340 

aatgccagct gtgccaagga cagcccaacc caagccccca ccagccccaa cggcaccgag 8400 

gcccgagctc acccgtgagg ggccaagttg gtcgtcacca acacggtctc caccccctcc 8460 

acaccactgc cgtccactgc agtgtccgga gttgtcacaa ccaccacctc ctccatgtgc 8520 

acactcacgt cgggagtcgc catggctcag cggaagggga cgcccaggcc agcagcgtca 8580 

gtcctccagg g.tcccaagtc ctggaggaag caaggcaggg cacagggatg gagtcatctc 8640 

cacatccaca caacatagca ctcacaaagg catctctaat cagctccaaa gacccaccct 8700 

tgagtcccag actgctacct cctgacaaaa acgagcggca acagaagggc tactccaggc 8760 

tctggttccg agggcggtgt aagcgcactc cacccgtttt tcccactgga taagccgaaa 8820 

cccttgggta. gaaagcacag agccactccc tccacgtggg gctcagagca ggaggacagg 8880 

aggggcctgg aattccaagc aacttccctg gacgcaggct cccggcttgc cagttcctcc 3940 

gcctctcctg gcctgaactc aaagccagcc ccaatccctg aactgagttt caggtgcaga 9000 

aagcactcca agaagtcctc gctggtctgt ggaacgggaa gggaaaccca ctcaagacag 9060 

aaagagagga gggaaacgce ctgggttttt ftgggttttt gggttttttt tgagacggag 9120 

tctcgctctg tcgcccaggc tggaacgcag ■ cggcacgacc tcggctcact gcaagctcca 9180 

cctcctgggt tcaagtgatt ctcctgcctc agcctctcca attgctggga ttacaggttt 9240 

caccatgttg cccaggctgg tctcaaactc ctgacctcag gtgatccact cacctcggcc 9300 

tcccaaagtg ctgggattgc aggtgtgagg caccatgcct ggcctgcccc gggtttaaaa 9360 

attattatta ttttgtcttt cctggctttg ccttcagcaa gtccaacccc tgctaaaacc 9420 

cggtgataat ggctgtcctg gcccaaaaag cttggagaca ggggaatctt cctcctgact 9480, 

aaaggaatgg tggcccaaga gtgtgggggc tccctgttgc cctctcactc tccatcccct 9540 

' acctagcaca gggaacacaa aagcccctgg cttccagcca gagggcaacg agcctggagt 9600 

cagagtgtgg gggaggcgac aagaggagag gggagaagag aggatggcac acagctgtgt 9660 

gtgagcgcct gggtcgtccc aagacagtct c.tacgtggtc ctgaccctaa agggcaaagg 9720 

' gaagaaaact gacctacagg ataggccact gcccaggtct cagatgggcc ccagtggcgc 9780 

atatgggaca gatccacagt gcactggaaa gtctctaaaa taaactggcc taagaacaca 9840 

gacacaggaa cggggtgcaa aatttgcagc ctgaacctaa ccaggtcgaC ttcttgctat 9900 

gaaaaaaaaa agtctacatt ctctgtgaaa ctcaaaacaa gacctagagt ccatagcaca 9960 

gtagtcaaag catccagaac acgatcaaac ttcctggcaa agggtagtct ggttgattct 10020 

caaaggaaca aatacacaag agaagctggc Ccttgaacgc agaatccaga gactttcagg 10080 

tgctatcgga ccagctccaa gaggaaagca aacattgtca accaagtgga- aagaaaatct 10140 

tggcatagaa acaggagtta taaccaaaca gaaatgtgaa aattaaaaac gacaaccaaa 10200 

agaaaataca caaagctggg atagtctcag ctacteggaa ggcggggctg gaggatcgtt 102 60 

tgagcctagg agattgaggc tgcaatgagc tgtgatcaca ccaccgcact ccag.tctggg -10320 

caacagagtg agaactctct caaaaaacga aaaagaaaga aagtagaaca gaagtgacca 10380 

ggggctgggg gagggagtac agggagttgt ' tctttaacga gtacagaatt. tctgtttggg 10440 

atgatgaaaa gctctggaaa tggacggcgg tgatggctgc acaatcactg tggctgttct 10500 

gaatggtgct gaaccacaca tttaaaaaca gttaaaatgg gctgggcgtg gtggctcacg 10560 

cctgtaatcc cagcactttg ggaggcggat cgcctgaggt caggagttcg agaccatcct 10620 

ggccaacaca gtgaaatcct gtcttgacta aaaatactaa aaattagcca ggcatggtgg 10680 

caggcacctg tagtcccagc tacttgggag gctggggcag gagacctgct tgaacccagg 10740 

aggcagaggt tgcagtgagc cgagatcgtg ccactgcact ccagcctggg caacaagagc 10800 

gaaactccat ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtttaaaa tggttaaatt 10860 

ttatgttatg tatattttac cgtaataaaa acactgtaat gctactataa tagaatgact 10920 

cattaggatt agatatagac tagaaagtac agaatataaa aactttttaa acaaagaaaa 10980 

attttcatgg ccaggcacgg tgtcacacct gtaatcccag gactttggga ggccaaggca 11040 

agaggaatgc ttgagctcag gggtttgaga ccagcctggg caacacagca acaccccatc 11100 

tctgctaaat aaataataaa_ aaatagccag gcatggtggt gtgcacgcct gtagttgcag 11160 

ctactctgga ggccgaggca' ggaggatcac ttaagcccag gaggtcaagg ctgcagtgag 11220 

ccatggttgt gccactgcgc cccagcctgg gcaacagatc aagaccttgt cacaaaaaaa 11280 

agaaagaaag aaaagaaaaa agaaagaaaa taaaatcttc cagaactttt aaaatcatca 11340 

tegttaatat aaaaataaca ccacctgccc ctaggactgt aacaaacaag tgtgtctaag 11400 

gacaggagtg ggtccacccc aacctggcac gcagtggtcc cctgcggaga gtctggccct 11460 

gcactcacta agaggaggca ctcatagccc agccaggcct ctgcaattat gccttcaatg 11520 

ccagaactaa ctcacccaaa ctgaacaatc gatcacaaaa tgtgccttca ggtctcaagg 11580 

ttcttgctaa atcttactca accgacattt tccagcatgg gaacattttt ctgaatgtct 11640 

tagggagagg aagtccgcaa. gagaacaaaa ggtcctcagg ccaccctagc ttcttttcct 11700 

ccattccaca ggctgtcttt tgtctgggta tgcactggac cagggggctc tacttcttcc 11760 
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tacctgggca tgggtctcca cacaactcca aggtaaaggg ccacaggcaa gataaagggg 11820 

agaaaagaaa gctacgattt cctgggccac caatcgcaaa tggcagccag tctctgaagt 11880 

aacccttgac cagagatcca aggaaccaag aaatgtaggt gatctgaaca gaggggatgg 11940 

tggttaaaca ccatgaagga aagacccatt cccaaagaaa aggaagcaaa aagaaaccgt 12000 

ggggagctgg gtaccacccg cagcaaagac cccgcacgcg ttactgacgc cagcctggcc. 12060 

tgggagagca g.tgagtgtgg cggacggtga gtggcgggga gggctgtggt aggtttaggg 12120 

taagaagggg cagcgcccag agcccagaga acaccagtga gggctccaca ggaacactac 12180 

tcaaagtatt cacggaacac acctaaacac aagcactaag gactaagtgc gagggacaag 12240 

aaaatattcc ccgtttcctg tttcaggagg gtatcgaaaa cgagtgatgg aaggaaaatg 12300 

tattgtttaa atgaggaaaa aaaattttta caaattaaga acatcctgga acatgatgag 12360 

ccgtttactg tcactcaatt taaatggtgg ccatctagga cagagcgcct aaggggaaag 12420 

ggggctcaca ggtgaacccc tccagctgct ggtgggcaat ttcceattag ggcatcaggg 12480 

tctctgaaga ctgtcttcag atgcttttta gccaggaaag ttacaatgat gaattcgttt 12540 

acactggcgg aattacttcg tatttctcaa atataatgtt ttcactagca taactttgtt 12600 

gttgtagact taggcttcaa aataaagaac tttaaacaaa catgaataaa aagccacttt 12660 

aggccgggcg cggtggctca cacttgtaat cccagcactt tgggaggccg cggcgggtgg 12720 

atcataaggt cagaagttca aagaccagcc tgatcaatac ggtgaaaccc cgtc.tctact 127 80 

aaaaatacaa aaattagccg ggcgcggtgg caggtgcctg taatctcagc tacttgggag 12840 

gctgaggcag g.agaatcgct tgaacctggg cagcagaggt tgcagtgagc caagatcatg 12900 

ccactgcact caagcctggg tgacagagtg agactctctc ttaaaaaaaa aaagccactt 12960 

taaaatttta ctcaggccag gtgtggtggc tcacgcccat aatcctagca ctttgggagg 13020 

ccgaggcgag cagatcacct gaggtcagga gttagaccag cctggccaac atggtaaaac 13080 

cttgtctcta ctgaaaacac aaaaattagc tgggcgtggt ggtgtgccca tgtaatccca 13140 

gctactcagg aggccgaagt gagagaactg cttgaacccg ggaggcagag gctgcagtgt 13200 

gccaagactg caccactaca cttcagcctg ggcgacagag caagaccctg tctcagaaaa 132 60 

aaaaaaaatt caaaaatttg gccaggcgtg gtggctcacg cctgtaatcc catcactttg 13320 

gaaggccgag gcgggtggat cacctgaggt caggaattca agaccagcct ggccaccatg 13380 

atgaaaccct gtctctacta aaaatacaaa aaaaaaaaaa caaattggcc gggcatggtg 13440 

gcgggtgcct gtaatcccac ctacttggga ggctgaggca ggagaatctc tcgaactccg 13500 

gaggcagagg ttgcagcgag ccaagattgt gccactgcac tccagcctag acaacagagc 13560 

gagactctgt ctcaaaaaaa aaaaaattaa aattaaaaaa taaaaatttc atttaaaata 13620 

ctactgatct cccgcgctga cttctcgggg tttaactctc actgaggaga cgctgctttc 13630 

ataagggtaa gctcagcagg ggcaactaaa gtcatttaag cagagagctg caaagaggca 13740 

acagcctcac tgcaggcagg ggtcctcgtc acagcttcag ggctttgcag aggattacgc 13800 

aatgtacacg cacaaaactg aattccagcc tctccattgg caactgcata catacatata 13860 

ttcttttttt gagacggagt ctcgctctgt agcccaggtt ggactgcagt ggcccgacct 13920 

cggctcaatg caagctctgc ctcccgggct caagcgattc tcctgcctca gcctcctgag 13980 

tagctgggat tacaggcgcc caccaccacg cccggctaat ttttgtattt ttagtagaga 14040 

cggggtttca ccatgttggc caggacagtc tcgatctcct gacctcgtga tccgcccgcc 14100 

tctgcctccc aaagtgctgg gattacaggc gtgagccact gagcccggcc tccaatggca 14160 

actatattaa aggttcaaag caatatgcac aaaagttacc tcacagaaaa tagtgcaagt 14220 

ccttgataca atgctcttta gacacagaag aagcactata gaatagagca cctcgcccta 14280 

ttgccttccc aagggcgagc accccctcct ' ctctccacag ctccttcttt gtttttttga 14340 

gatggagtct cgctctgtca cccaggctgg agtgcaatgg caaaatcttg gctcactgca 14400 

acctccgcct cccgggttga agtgattctc ctgcctcagc ctcccgagta gctgggacta 14460 

caggcaccca acacgcctag ctaatttttg catttttggt agagacgggg tttcatcatg 14520 

ttggccaggc tggtctcgaa ctcctgacct ccagtgatcc tcccaccttg acctcccata 14580 

gtgctgggat tataggtgtg agccactaca cctggcctct ccacagcccc ttctgtgttg 14640 

aagccaagac ccacccagct ttgatcccaa ggctcgggtt ccccactagt gtgaagtgag 14700 

tttccaaatt attaggtaaa tcagatatga gaaaatattt tattttactt tttttttttt 14760 

gagacgcaat cttgctccgt cacccaggct ggagtgcaat ggcaccatct ccactcactg 14820 

caacctctgc cttctgggtt caagcaattc tcctgcctca gcctcccaac tagctgggat 14880 

tacaagtgca caccaccacg cccggctaac ttttgtattt ttagtagaga cagggtttca 14940 

ccgtgttagc caggctgctc tcaaactcct gacctcatga tccgcccacg tcgggctccc 15000 

aaagtggtgg gattacaggt gtgagccatc acacctggcc caagaaaata tttttaaact 15060 

agtattcttg accggcacgg tcaacactga tgtaattgaa actgttgtat ttgaagtgtt 15120 

agcaaagaaa gagaattctg gttcaacaga aaagtcagtc acgacttttc agtcacgcat 15180 

gaattacaca gtaaccaaat agataacatg ccatgactga cgacgggccc acaacaaatc 15240 

agctccgacc aacagggtcc acaccaccat gggtctacac agatccaggt cccgcctgtg 15300 
agcctacag.t gacgcgggcc cctgtggggt ggtccctgca ggtcaggtcc ctgagagtgg . 153 60 

gtcccagtgg ggtgatccct, gcgggtcgcg tccctgcgag ttgggtgcct gccgggtggc 15420 
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ccctgcgggt cgggtgcctg cggggtggtc cctatgggtc gcgtccctgc gggtcgggtg 15480 

cctgcggggt ggcccctggg aatcgegtcc ctgcgggtcg ggtgcctgcg gggtggcccc 15540 

tggggatcgc gtccctgcgg gccgggtgcc tgcggggtgg cccctgggga tcgcgtccct 15600 

gcgggtcggg tgcctgcggg- gtggtccttg tgggtcgcgt ccctgtgggg tggtccctgt 15660 

gggtcgcgtc cctgtggggt ggcccctgcg ggtcgcgtgg tggcccctgc gggtcgggtg 15720 

cctgcggggt ggtccctgtg ggtcgcgtcc ctgcgggtcg ggtgcctgcg gggtggtccc 15780 

tgcgggtcgc acccctgcgg cgtggtcccc ccgggatggg tccaccgagg aggccgctgg 15840 

aggccgagcc cgcgcccgcc cgcggcgcca agatggaggc aggaagcgcc gccgcccgcg 15900 

cccgccaccg cccgcgccgc ccgcctgacg ccgccgttgc gcctgacgcc gccgcccgcg 15960 

cggccgcccc tcccccggcc ctcccctccc cccgccgtaa cgtcctgacg ctccgcaggg 16020 

acccctgact ggacggcggc gcgtgagcgg agcgagaggc ctcgccgcgg gggggccgcg 16080 

ggctcgccgg cgccgcttac ctggggccgc gccgggcctg cttaggcacc cggcgggggc 16140 

ggcggcgtcg ggagctgcgg cggcggcggg cggcggcggc ggccgcgggc ttcgctcctt 16200 

gttggggatt cggcggcggc ggcggcgcgg gcgcgcgctt cctagtgacg caggcggcgg 16260 

ggccgcgcac gcacggggct gggagggccg gacacttatt tggcgctcgc ggaggaggaa 16320 

ggcggggccg tgaaataagg cccgacgggc cccggggcgc gtgcgcggac cgacactgtc 16380 

agcccctaac gccgcaggtt cctcctggtc cccgaggccc ccggtcgggc gttgcctgcc 16440 

ccgcgcgggc ggccgggccg agggacgatg gtcagtggac ggacggcgcc agggagcagt 16500 

gcccacgcgc ggcagggcgg taccttcagg cctccagg.ta cgggcgctcc tcgcccggac 16560 

gctgctgtgt gtgaatgggc gcgaggggac tcccctgcgg ggcggacgcc tgaacacgag 16620 

gctgtggagg aggacgctgt agggcgcgcg gactcacgcg gaacatgcca gaggctcagc- 16680 

cagccacggc gctcccagcg tggagggcga ggggcatccg ggagcggccg ggagggctcg 16740 

gtcacccctc aagctgtcac cccagtccca caaccagcac cccgatccta tcgcagtccc 16800 

acagccgaca ccccgatccc acccctgccc aacagccggc acccacccca atcccatagc 16860 

taacaccccg gtcccaccgc tgtcccacgg ccggcacccc gatcccaccc cagtcccgca 1692 0 

gctggcaccc cgatcccacc ccagcc.caac agctggcacc caccccgatc ccaccgctgt 16980 

cccacagccg gcaccccgat cccaccccag tcccgcagcc ggcaccccga, tcccacagcc 17040 

ggcactcacc ccgatcgcat agcatagctg ataccccgat cccaccccag tcccatagcc 17100 

agcaccccga ccccacccca gtcccatagc cagcacctcg atcccataga tgacaccccg .17160 

atcacgcccc agtcctatag cccgcacccc gatcccaccc gagtcccgca gccggcaccc 17220 

catcccaccc atgtcccaca gtcggcaccc cgatcccact cggatccggc agccagcttg 17280 

gatcctgtgg ccctcctcca gcccccaggg ctcatttata tgttttattg gcagaggctg 17340 

gggctggctc tgttggcctc tgtgctgggt ttcttcctct gcaccgcagg actggctctc 17400 

ctgacctctc caggtgtcat cgaacaccct tgtgcttgct gtcacccgct gcctgtctgc 17460 

aggatcccgg attccgtatc aggggaccga aattagtcgg. aaaataggaa gcaggtgctc 17520 

gcttggatgg aaccctgacc" ctgtgctcac acttgtagga ggagggctct gcaggccgcc 17580 

tcccggaacg ggaggttccc aagccactgc acttcggagg ggctgtaatt agagttgcac 17640 

attcattcag ttcccagtaa agtagaacgt gctccagcca gtgaggaaaa ggtgttttta 17700 

aaaattagat tggccgagtg cggtggctca tgccttttac ctcaacactt tgggagacaa 17760 

aggtgggagg atcacctgtg gccaggagtt caagaccagc ctgggcaaca gagcctgtct 17320 

ctggggaaga ataaaaaaaa aaattgagcc tttgtcagtg ctactatttt attatctggt 17880 

aaatatgaga gggttcacgc ggtctatgtg tgtcatttat ctgagtttgc ctatcgtcac 17940 

gttttggaaa taaatgtcaa taaagtcgaa gaggagtgct gaggggggcc tggggatggg 18000 

agggtggcta catcatgcct gtgtgttgcg caagcccacc gaggtcggcc tggggtgagc 18060 

cctggggcct gttctgcctc cttcactctg gggctccaag agacaaactg ggcaacaaga 18120 

gagaaactcc atctaaaaaa aaagaaaaat cacctccaag ataacttagc tttcttctgc 18180 

tggcataaca aattatctca aacttagtcg cttaaaaatg. caaatttagg ctgagtgcgg 18240 

aggctcacgc ccataatcct agcactttgg gaggccaagg caggattgct tgaggccagg 18300 

agttcgagac caacatggcc agaactgtct ctttttaaaa aatgcaaatg. tgtccggcac 18360 

ggtggctcac gcctataatc ccagcacttt gtgaggccaa ggcgggcaga tcacgaggtc 18420 

aggagataga gaccatcctg gctaacactg tgaaaccccc tctctactaa aaatacaaaa 18480 

aattagcctg gcgtggtggc aggcgcctgt agtcccagct actcgggagg ctgaggcagg 18540 

agaatggcgt gaacccagga agcggagctt gcagtgagcc gagatggcgc cactgcactc 18600 

cagcctaggc aacagagcaa gactccgtct caaaaaataa ataaataaaa ctgcaaatgt 18660 

attctctaac tgttctgtag gtcggaagtc cagcccagcc tcactccgcc aaaatcaggg 18720 

tgtctgcagg gccgattgct tttggagctc caggggagaa gctgttctgg cctttccagf 18780 

ttctggaagc acttgagccc cttgtctcgt ggcctatccc acacctgaaa gccagccaaa 18840 

gccagttgag tcctcaccct gttggccccg acactgatct cctgcctccc tcatctgctg 18900 

tcaaggcccc ttgtgatgac atggggccac cagctggccc agggcacctc ctgtcagagt 18960 

ccgccgacca gtgaccttca ttccatctgt cgctgtaatt cccctttgct tggaaccaac 19020 

gttcacagat cccaggggtt. aggatgtgaa tatcttgggc agggctgtgg gggggctatt. 19080 
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cttccttcta aaatatttat catttttgtt ttggggattt ttttggtttg gtttt.tttcg 19140 

agacagagtc tcgctctgtc gcccaggttg gagtgcaatg gtgcaatctc agctcactgc 19200 

aacctctgcc tccgggcaga cgtgagccac tgcaccaggc ctgtttttgt ttttgtttgc 19260 

tttgttttgt ttttgagatg gagtcccggc cgggcgcggt. ggctcacgcc tgtaatccca 19320 

gcacttcggg aggccgaggc gggcggatca cgaggtcagg agatcgagac catcctggct 19380 

aacacggtga .aaccccgtct ctactaaaaa tacaaaaaat tagccgggcg tggtagcggg 19440 

cgcctgtagt cccagctact cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 19500 

ggagcttgca gtgagccgag atcgcgccac tgcactccag cctgggcgac agagcgagac 19560 

tccgtctcaa aaaaaaaaaa aaaaaaaaaa aaaaaaagag atggagtctc actttgtcac 19620 

ccaggctgga- gtgtagtggc gggattatag gtacgcgcca tcatgcccag ttact.ttttg 19680 

tatttttagt agagacaggg ttttaccatg ttggtcagac. tggtctcaaa ctcctgatct 19740 

caggtaatcc acccgcctca gcctcccaaa gtgctgggat tacagacgtg- agccaccgtg 19800 

tctggccata tttattaact acaaagggaa agatgataat tttttttttt gagatggagt 19860 

ctcactctgt cacccaggct ggagtacaat agcgtgatct tggctcactg aaacctctgc 19920 

ctcccaggtt caagcgattc tcctgcctca gcctcccaac tagctgggat tacaggcgca 19980 

cgctaccaag cccagctaat ttttgtattt ttagtagaaa cggagtttca ccatgttggt 20040 

gaggctggtc tcgaactcct gaccttgtga tctgcccacc tcggcctccc aaagtgctgg 20100 

gattataggc atgagccact gcaaccggct gaaagatggt aattttaaag tagagaaact 20160 

gggttggctg ggcatggtgg cttatgcctg taagctcagc actttggaag tccaaggcaa 20220 

gaggatcgct tgagtccagg agtttgagac cagcctggac aatatagcaa' gaccccatct 20280 

ccgcaaaagc taaaaagtta gccaggtgtg gcggcacatg cctgtagtcc cagctactca 20340 

ggaggctgac gtgggaggat cacttgagac caggaggtca aggccgaagt gagctgttat 20400 

tgtgccactg cactcagcct gggcaacaga gcgagagtcc gtctccaaag gtaaaaaaag 20460 

gtccaggcac agt-ggctcac acctgtaatc tcagcacttt gggaggccga ggcgggcaga 20520 

ttcgttgagg tcaggagttc aaaacgagcc tggctaaatg gtgaaacccc gtctctacta 20580 

aaaatacaaa aaaattagcc aggcatggtg acgggcgcct gtaatctcag ctacttggga 20640 

gactgaggca ggagaatcat gtaaacccag gaggctgagg ttgcagcgag ccaagatcat 20700 

gccac.tgcac ttcagcctgg gcgacagagc aagactgtct caaaacaaaa caaaagaatc 20760 

ttgagtcctg agttcctcta agggaaattc caggcacctc gccacccttg acaggcaaag 20820 

gaacaatctg atgaggaaga agatagaaac agcttaaaca atagtctccc ggccgggggc 20880 

agtggctcac gcctgtaatc tgagcacttt gggaggccga ggcgggtgga tcacaaggtc 20940 

aagagatcaa gaccatcctg gctaacatgg tgaaaccccg tctctactaa aaatacaaaa 21000 

aattagccgg gcgtggtggt gggtgcctgt agtcccagct actcgggagg ctgaggcagg 21060 

agaatggcgt gaacccagga ggcggagctt tcagtgagct gagatcgcgc ctctgcactc 21120 

cagcctgggc gacagagcct cgagactcca tctcaaaaaa aaaaaaaaat tagctgggtg 21180 

tggtggctca cacctgtaat cccagctacg Cggcaggctg aggcaggaga atcgcttgaa 21240 

cctgggaggc ggaggttgta gggagctgag atcgcaccac tgcactccag cctgggcaac 21300 

agagcgagac tctgtctcaa aaaaaaaaaa aaaaaacaaa aaaacaatag tctcccaagt 21360 

aagtcagagt cacaaggtgt tttgattccc tgtggaaact aaaatataac agcttaacat 2142 0 

atgttcttga gttatttttc agaaacttgg acacccacca ggtggaaaat gctgagctag 21480 

gaacagtggc tataatttca gccttttgag aggccaaggt ggaaggacca cttgaggcca 21540 

ggagttagag accagcctgg ccaacatggt gaaaccccgt ctctagtaaa aatacaaata 21600 

ttagctgggc atggtggtgc aacctgaaat cccagctact tgggagacct agctgggagg 21660 

atcgcttgaa cctggtagga ggagtttgca gtgagctgaa attgtgccac tgcactctag 21720 

cctgggcaac agagtgagac tctgtctcaa aaaataaata aataaaaaga gaaaaaagtg 21780 

ttgcctgcag gccgggcaca gtggctcacg cctgtaatcc caacactttg ggaggccgag 21840 

atgggcagat cacctgaggt caggagtgca agaacagcct ggccaacatg gtgaaacccc 21900 

atctctacta aaaatacaaa agttagctgg gtgtgtacat gtagtctcag ctacttggga 21960 

agctgaggca ggagaatccc ttcaaccggg gaggtggagg ttgcgatgag ctgagatcac 22020 

gccaccacac tccatccagc ctgggtgaca gagtgagact ccatctcaaa gcaaaaaaag 22080 
aaacataggt gggacccttg gtgtgtcctt agggcatgat ggttgaggta tactgctgg.t 22140 
cctgtcatgt aaaagaaaac gagccgactc . tgtgtctact ggagaaagca ctgcatatat 22200 
cagccacagt caatacctcg cttctgcagg gacggtggct gccagagtgg gaggctttgg 22260 
tagcacccat gtcgtggaat cacaatgttg tcgatagctc tggggtcttg tacaaaatgc 22320 
cagatcctcc- catttggttt ccttatggga aggatcgcag tactataata catgggcttg 22380 
tgcaagggat cattataccc ttttctcttt t.tttgctttt ctttgagaca gagtttcact 22440 
ctcgtcaccc aggctggagt gcaatggcgc gatcttggct cactgcaacc tccacctcct 22500 
gggttcaagt gattttcctg gctcagcctt ctgagtagct gggattacac atgcccgcca 22560 
ccaggcctga cttatttttg. tatttttagt agagacaggg tttcaccaag ttggtcaggc 22620 
tggtcttgaa ctcctgacct caggtgatcc acccacctcg gcctcccaaa gtgctgggat 22680 
■ ttcaggcata agccaccagg cccagccttt ctttcttttt- aaaattaatc tttgtttaaa 22740 
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aatactctca ttttttattt aattgtagca ctcctagatc ccgaaagcag atacactctt 22800 

gttatgggtc tgattctttt cattgcttca cgccttagag gatattgtcc aatactggat 22860 

aaaagtttac tcaggtctac ttccacttta. acggggatgg ctgaatatct cttccacttg 22920 

gctgtttgtt tataatgaac tgacaaacat acaaattttc ttgagttctg tgagacattc 22980 

tagtaaatca tctaacctga agagcaggtt gtgagaaccc ctgatttaga aagcccagtg 23040 

gtcataaata taagtggctc tggactggct cccggggtct gaagtgtggg cagtcggtta 23100 

ggattgagcc cttgtaattt gtaggatctg acacacactc caggaaggca gtgtcagaat 23160 

ttacctgtat tatattggac acccagttag cgtttggaga attggttgct ggtatagaaa 23220 

aataccaaat attttatgtc aggggagtga aagaaaaaac aaaaacccgg ccgggcgcgg 23280 

tggctcacgc ctgtcatccc agcactttgg gaggccgaga ■ cgggcggatc acgaggtcag 23340 

gagatcgaga ccatcctggc taacacggtg aaaccccatc. tctactaaaa atacaaaaat 23400 

tagccgggcg tggtggcgcg cgcctgtagt cccagctact cgggaggctg aggcaggaga 23460 

atggcgtgaa cccgggaggc ggagcttgca gtgagcccag atcgcgccac cgcactccag 23520 

cctgggcgac agagcgagac tccgtctcaa aaaaaaaaaa caaaaaaaaa aaacaaaaaa 23580 

aaaaaaccca tacactttaa ggaaagcaac tgacagcatt tgttaccagt gataaaattt 23640 

gagccttgaa gtaagaataa caattttgcc attgtgcccg ggccaagaaa aaaaaaagaa 23700 

ttttgccatt gtgaaaggct tcccagtact. ttctgatgag cttgacggtg atattaacaa 23760 

■ataacttttt tttttttttt ttgagatggg gtcttgctct gtcacccagg ctggagtgca 23820 

gtggttcaat ctcagctcac tgcaacctcc gcctcccagg ttcaagcgat tctcctgcct 23880 

caacgtccca agtcgctgga ctacaggtgt gcgccaccac gtccagataa tttttgtatt 23940 

tttagtagag atggggtttc accacgttgc ccagactggt ctcaaactcg tgacctcagg 24000 

cgacccgccc acctcggcct cccaaaggtg ggaggccttg ctgggattag aggtatgagc 24060 

cgctgcacct ggcctcttgt ccttgtgttt tgcagtgatg caatgaccat gtcttacatt. 24120 

tgcaaccaga aaaaaaggtt agtgtaacaa ' tgtttatcct gtttttccca gagtagacat 24180 

tatgaagatt aaaaaaattt gaaagcgtcc tgaatataat aaactatgct atacacacaa 24240 

cattttggtg actagaaata caagtttatt gtttgttgtt tgttgagaca gggccctgct 24300 

.ctgtctccca ggctgggtgg cacaatcatg gctcactaca gtcttgaact cctgggctta 24360 

agcgatcctc ccaccccagc ctccagagta gccgggactg caaacgagca ccaccacgcc 24420 

tggctaatat ttgtattttt tgtagagatg ggg.tttcacc atgttgccca gactggtctc 24480 

aaactcctgg gctcaagcaa tgctcctgcc tcggcctccc aaagtgctgg gatcacaagt 24540 

atgagccact gcacccggct gagtttctgt tgttttaagc cgcttcattt gtggtacttc 24600 

ttacagcagt cccaggaaac cgagcaactg cagaacacca aaattgtttt tcttcagcaa 24660 

aaggagaagc acttgtggtt ggcaccagct tttcctgtgc tcacttctgc atggccgcac 24720 

ctttgcccga cacgagtgca cagcaggctg tgggggagca actggttgag tcaggcctcc 24780 

acttgtgccg tatccccacc tgctttgctg gacacccctg tttggggggc acccactgct 24840 

gccccagaca ccaagcaagc accagctgtg tccaaaactt acagtcactg tcttggcccg 24900 

ttttgtgctg ctgtaacaga atgccacaga ctgggtaatt taatacagaa cagaaattta 24960 

tttcctcaaa gttttggagg ctgggaagtc caagagcaag gggccatcag gtcagggcct 25020 

ggtctctgct tccacgatgg caccttgacc accgtgtcct cacgtggtca gagagagccc 25080 

actcccagga gcccttttaa tagagcagaa cactgctgcg ctgcggttaa gtttccaaca 25140 

cgtgaacttc ggaggtgaca cattcagacc atagcagtca ctctaggcag agtgtctgat 25200 

gtggttttaa aatacgttca cagactggcc gggcactgta gctcacgtct gtaatcccaa 25260 

cagtttggga ggccaaggtg ggtggatcac ctgaggtcag gagttcaaga ccagcctcac 25320 

caacatggtg aaaccccatc tctactaaaa atacaaaatt agccaggtgg tgcatgcctg 25380 

taatcccagc tactcgggag gccgaggctg gagaatcgct tgaatccagg aggtggaggt 25440 

tacagtgagt cgagatcatg ccattgcact ccagcctggg caacaagagc gaaactctgt 25500 

ctcaaaaaat aaaataaaat aaaatacatt cacaaggccg ggcactgtgg ctcacgcctg 25560 

taatcccagc tacttgggag actgaggcag gagaatcgct tataacctgg gaggtggagg 25620 

ttgcagtgag ctgagatcac accgctacac tctagcttgg gcaacaagag tgaaactccg 25680 

tctcaaaaaa gtaaaataag gccctgcagg catggtggcc cacacctgta atcccagcac 25740 

tttaggaggc caaggcggtc ggatcacgag gtcaggagtt cgagaccagc ctggccaaca 25800 

tgatgaaacc ccgtctctac tagcctagcc aacatgggga aaccctgtct ctactaaaaa 25860 

tacaaaaatt agccgggcat ggtggtgcgt gcctgtaatc ccagctactc aggaggctga 25920 

ggcaggagaa tcgcttgaac ccaggaagca gagggtgcag tgagccaaga ttgcgccgct 25980 

gctctctagc ctgggcgaca - gagcgagact ccatct.ctaa ataaataaat aaaataagaa 26040 

aataaaatat gttcacaaat cctttgacat tcctcacctc aaaagctgga acccaactcc 26100 

ctcctaagca tgagtcttct cagtgactca cttctaacag cagaacttac atggttcccc 26160 

acacccagag gacattgggt tcctcccaat atccccccac ccagcgaccc ccacccaggt 26220 

cgctggcttt gggtccccca gagccatgtt tcaaggacac tcaggcagcc cctggatgtc 26280 

. cacgtgg.taa ggaatgaagg cctcctgcct gcagcctcgg gagggagcat tctcagaaga 26340 

. ggatgcccca cctcctgccc agccttcaga tggccaggac ctcgtccaac gtcctgactg 26400 
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caacatcatg agagactccg agccagaaac ccccaggttt tgtactcctg acttatggga 2 6460 

actgacagat aatgttcgtt gttaatcaag gggtgacttg tcacacacaa taggtcacta 26520 

aacagctctg tctggcctcc caggaggagc ctgcctttcc ttttcttcat g.ggaaaagtg 26580 

cgatcagttt gtgaaggaat gtccgccccc acttgatgcc agaggctcca catggtgact 26640 

gtcataaact ccatctgccc tcagtgcctt gccagcaccc ggcccgc.gat cagcctggcc 26700 

ttgcgggagg ccaaggccca cgtgtgtttg tgtgtggtgt ctgtgtctgc gtgcccatgc 26760^ 

atgcccaggg tacagggatg ccatatacaa attctttcaa tgttgtatgt ggcatgtgtg 26820 

tgtctgtatg cccaggatac agggatgcta tatacaaact ctgttttttc gttttttttt 26880 

ctttgagaca gagtcttgct gtttcgccca ggccggactg cagtggcgct atctcggctc 26940 

actgcaagct ccacctcccg ggttcacgcc atcctcctgc ctcagcctcc tgagtagctg 27000 

gaactacagg cgcccgccac cacacccggc taattttttg tatttttagt agagacgggg 27060 

tttcaccatg ttagccagga tggtcttgat ctcctgacct cgtgatccac ccgcctcagc 27120 

ctcccaaagt gctgggatta caggcatgag ccaccacgcc tggcctacaa accctttctt 27180 

ttttcttttt ttcttttcga gatggagtct cactgtcttc caggctggag tgcagtgatg 27240 

cgatctcagc tcactgcaag ctccacctcc cgggttcatg ccatztctcct gcctcagccc 27300 

cccaagtagc cgggaccaca ggcacacacc accacgccca gctaattctt tgtgttttta 27360 

gcagagatgg ggtttcacca tgttagccag gatggtctcg acctcctgac ctcgtgatcc 27420 

gcccgcctcg- gcctcccaaa gtgctgggat tacaggcgtg agccactgcg cccagcctgc 27480 

aaactctttc aatgtccttc ttttctctct cctgcca.tct tctcccttgc agatttcttc 27540 

tgtctctacg tcttccccag ctgagtccga ggtcctgact cgcccacgct ccctggactg 27600 

gaggagaggt gatagcaaga gctccttcaa gcccaggaat gccaccaggg ctgccccggg 27660 

agaggaggaa gctgggtctc tcggggttgt ggggaccaga cacccttcta agacatggac 27720 

tcagcacaga aagtccagac atccactaca aacacatctc cctcctaaca gggggcccct 27780 

gggcacccca agtggctgtt cggtgggaca ggcatgtcca ccagtcagaa tacctttatt 27840 

tcttattttt tattttttat ttttgagaga gtttcactgg agtgcaacgg cacgatctca 27900 

gctccctaca acctccgcct cccaggttca agcgattctc ctgcctcagc ctgccacgta 27960 

gccgggatta caggtgtgag ccaccacacc cagctaattt ttttcttttt tttttgagat 28020 

ggagtctcga ggctctgtcg cccaggctgg agtgcagagg cgcgatctca gctcactgaa 28080 

agctccgcct cctgggttca cgccattctc ctgcctcagc ctcccgagta gctgggatta 28140 

caggcatgag ccaccg.cgcc cggccaattt tgtattttta gtagagacag ggtttcacca 23200 

tgttggtcag gctggtcttg aactcctgac ctcaggtgat ccacctccct cggcctccca 28260 

aagtgctggg attacaggcc tgagccacca cgcccagccc agaatgtctt cttacttttt 28320 

attactctgt cccccatcct gggtccagac ctgtgaccgt gaacaaccgg ctgcccaggg 28380 

gtgaatgggg tgagtggggt gagtccacag aacagtgggg tgcagcccca ggggtctcgt 2 8440 

agcacctgcc cccaggtcag gaagtcccac agcctagagg ctccagcctc agatgcatac 28500 

atatgtaggc cctgcccttt cctcctgagc ggcgggccac agagtcctga acaacaggaa 28560 

gcccctgagg agggctccgc cctgagggag ggcaggggag cccccgccag ccccacccac 2 8620 

agcagcgggc cctgccaccc cccaccctga cacctcaccc cttggattcc agagaggaaa 28680 

gtgggcttgt gtgtagttta catgctcata tcttaaaatc accgttgtca atagaacaat 28740 

tcataataat gatgataaaa taagatttat aaccagcttc agtctggaga tacacacaga 2 8800 

gcagatcttc actcccagac agggagcccg cagctgcccc cgaccccaca ggtgcaggac 28860 

acacacagac agttcaacca tgtcttaaac acacaggtgt ttatttaatt gttcatttga 28920 

ttgaattttt aagttcactt tactacgtgg atgagatggg tgcatattac agtaggcttt 28980 

cgctatgagc gctgccacca tgaggaatat cccagccctc agttctgctt ccctttctga 29040 

gtcccacaaa agccagatgt ggacagcctt gggttcccat cccagctggc tgctccttct 29100 

ggggctgtct tggtggggag agggagatgg ggcagtgggt ccctgctgac ccctgagccc 29160 

tgcaggggtc aggatcctcc cgtggtccct gggtgtggct ctggaagaca ctggcagtgc 29220 

ccggccaagg cctcccgcag gatggaagtt gagggccctg gctctgggtc ctaagagaac 29280 

tcagccgccc ccttcacact ttacagcaag gggccaggca gcagctttgg gatggggctt 29340 

ccgtggagaa gtgggggatg ctgcagtggt acaaagacag cctcccccac cgccatcctc 29400 

cagctgaccg tcctccaagg ccagcactgg gcgtccaagg gaaagaagga actcagccca 29460 

gagggtgtgg gcaggagagg cctgg.agtca ggcctccacc cacagccccc tctgggtgcc 29520 

aagtgggaag ggtgttgggg ctggcttggg aaccttaccc gctgcccttc caacacctgg 29530 

atctgtgggc agcggtccca caaaatcccc cttggggctc cctgaggagg acttgtggct 29640 

gccgcttcca ccagggcag.a gggcacagga ggggccagca ctccaaaggg ctctagggtg 29700 

ggtctttcaa ggacatctgc aaagccctgg tggggagggg cctgggccag. aggctctttg 29760 

gaactcttgc acttctgagt gggggactgt ccatgctgcc cacaacctct agaccatgca 29820 

gcctgctcat gggtccctgg cagagaatgc ccactcccca gcagactcag ggcaggcccc 29880 

caactgcagg cttccaggaa ggcccagggt gtccacctca cgccaggtgg tctcagagga 2 9940 

cccctgtgca accacattaa ggaaagctgc agcccccacc cacccgcctg ccagttcaac 30000 

aagcaccggc tgcacacg.ca. ggctcccagg caccatcacc cccctccccc gtcgcccctc 30060 
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cctcacgggg agccccttcc ccctggaaag acagcaggca ctgtagcctc gcctgctggc 30120 

caggggcgcc ggctcagagg acctgccctg acctgcacgt gctgaccaga cagcccagcg 30180 

taaggacccg cgatcccacg ccaccgccct gggtttacca cggtcaccac cacctctctc 30240 

acagggcccc cgggggaccc agccgcgccc ggcctggtgt ctgcaccgag ggaccgcgtc 30300 

tcacgcccgg cggctcctgc aggggaagcc gtggtcagcg actcaccacg aggacagggc 30360 

agggcggctg agcgcggaag agaagcatga agctgggggc gggggcgggg gaggaggaac 30420 

aaaagttgca tctagacaga ggtgaacgaa acaaaaccaa aacccgaacg tgttccgtcg 30480 

caggatgggc gccgcccgtc ccgggccctt agcccgacat ctcttctcgc tgctccttgt 30540 

tcctgcgcac- ctcggccgcg tgcagctcct gcaggacagg gggcgggagg gcctgagggc 30600 

gggggcggct cggggcgact ccgggaaccc ccaggcgcgc aggccgtggc gccctggcac 30660 

ccgcccggcc tcatccgggc tggccttcgg caggaccctg actgagttga gggggcggga 30720 

gcaccgggga ggcgcagagc aaggccaggg accaaggacg ggtttcctgg gagctggctg 30780 

ggccccgctt ctagctcgta ccggagccga gcttccttca gggcactttc aatataatga 30840 

atttagccat ctattactgc ggctagttac tgtcccgcca ggaccagac.t ctggacctgc 30900 

ctcgtgcgct gctggggacg cccagtaaac acgggaggag cccccgaccc ccaccccagc 30960 

tcagcgcctc ggagtccccg gccccgctct gcgcccctcc gagctccgcc ctagccccgc 31020 

ccccgcccag tgccccgccc cctgcctgct gctagccctg cccccgcccc ggcccctgcc 31080 

cgctccgagc tccgccctgg ccccgccccg gcccctgccc gctccgagct ccgccctggc 31140 

cccgcccccc gcccagtgcc ccgccccctg cctgctgcta gccctgcccc cgccccggcc 31200 

cctgcccgct ccgagctccg ccccggcccC'gccccggccc ctgcccgctc cgagctccgc 31260 

cctggccccg- cccccgccca gtgccccgcc ccctgactgc tgctagccct gcccccgccc 31320 

cggcccctgc ccgctecgag ctccgccccg gccccgcccc ggcccctgcc cgctccgagc 31380 

tccgccccgg ccccgccccg gcccctgccc gctccgagct tcgccccggc cccgccccgg 31440 

cccctgcccg ctccgagctc cgccccggcc ccgcccccgc accttctcgc gcagccgctc 31500 

gcgcagtgcg gccaggtgtg cctcgcggat ctccttgctg agctccatct tgtagttgag 31560 

cttctcctcc gcctggcggc tgaagttgtt attctcctcc agcgccttgt gcagcacctc 31620 

gcgctcgtgc tcgcg.ccgct ccgccagctg cttcagcacc tgcgcctcct gcgtctgtgc 31680 

ggggccggcg ggcgcgcgtg agcggcaacc ccgggccctg cccggccgga ctcctccctg 31740 

ctctccgcct cccgcccagc gcccgctcgc ctcacctggc gcctccacct gcccaggcct 31800 

cggtgggcgc cgggaccccc gggcgctgcc ctgggaaccc tcgcctgcca tccggcctgt 31860 

ggtcggggca gggccagggg gtcgcgatcc gccgcccccg cccccgtccc tgcctcgcgc 31920 

gcgggtcccg cggtcctggc tgcgcccagg gcccccgcca taccctgccg ccactgcaca 31980 

ccctgccctg cgcgtctgcc cctccaagga ccagcagcaa gaaaccctaa acttgtgggc 32040 

ggtctctgag ctttgtctct tcctcggaca tccgcccact gagcagagta gctgcttgtt 32100 

acacaccggg ttcccagctc ccaattaggt gcccaggagc ggagggtccc cagggatgct 32160 

gggggagggg ccggctggtg acccctggga ggagagcggg gcagcaggac ccgcacccac 32220 

atgccagtcc ctactagtca gccctgtgaa ccctggtctc tggcctcacc gggaagggaa 32280 

cggagccgct tcccctgccc aatgcgttgg cctccagggt ggcaccccca aaaggacatt 32340 

tttatctctg tttcagtctc agaggggctg gtgggagggg aggctgcagg gaggggacct 32400 

ggagcccaca cccacctctc ccagggcccc tccgccctcc agcaagcctc agggtcttca 32460 

cacatgaggc ccttcctcca gcttccctgt ctgggagagg gatgccccac ccgacgtccc 32520 

cagggcccac ctggggacca ccccctagca tcctgctggc cctgacaagg gtgcctccca 32580 

ccctcaccag aggctcctgc tccttccagg tggccgcctc ggaacccttc ctcctctcca 32640 

tccctttctt tttttgttct tgtttgtttt ttgaaatgga gtctcaccct . gtcgcccggg 32700 

ctgaggagtg cagtggcgca gtctcggctc actgcatcct ccacttcttg ggttcaagca 32760 

attcccctgc ctcagactcc ctagtaggtg ggattacagg tgtgcaccac cacacctggc 32820 

taattttgta tttttagtac agatggggtt tcaccatgtt ggccaggctg atcttgaact 32880 

tccaacctca agtgatctgc ctgcctcagc ttcccaaagt tctgggatta caggcgtgag 32940 

ccaccacacc cggcctctcc ccatcccatt cttatccctc agaaagaggc ccagggagcc 33000 

acagcccctc ctgctccagg ccaaggcact gaccaagcct gtccgggagc accctgcttc 33060 

ttgcaggccc tgtccccgtg ggccgcctcc gttgaaactc ctggggggtg ggggatggag 33120 

gactccttgc cttcctccgc tcctcggctg cctccagccg cttttgcagc tcctccaggg 33180 

aggtgtcctt cttcttgggt ggggaggaga gcatagggct ctctggggac aggtcagaag 33240 

gggacttgag gatgacctcg aagctctggc ctgaggcccg cttgtccagc tgcttcacct 33300 

ccatgtctgc agggcaagac cagagtagag cttcagaggc ccggccaggg catggcgtgg 33360 

gctgagcggg atgctcccag cacacatcca accccagggc tgggcgagag ggggtggctg 33420 

ctcccgcagg aatcccaggc ttcagccccc aggatgggcc ccttccccct agaacccccc 33480 

tctccagagg cagccaggac gggagttcag agagactgcc ggaggccggg ggaaaaggtg 33540 

aggtgggcag gcaccgcagg gaagggcagg cggcagccag gcactcaccc ccgtactggt 33600 

agacggtatt ggggtgcggc tgtgtgtaga agcaggagca gatgagcgac agcaccgaca 33660 

gctccttcat cttctccttg taggctgtgg gcacaaggct gggctgagca agcaccactg 33720 
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gggcctgccc acctgggccc ccgttttccc tccccatggc tgcctctatc atgtctctgc ' 33780 

gagacacgga gctgcccagc acgctctcct gtgtgtctcc acaccgccgg ccccttcgtc 33840 

tctccagctc tctcgcttcc agacgtcggc actgtctccg tggtgtgtcc cctgccttct 33900 

gtctctctcg ccctctgcct ctccccgctt ttcctctctc tcggcattaa tgtctgtctc 33960 

atcttccaca ctgacttgtt tctccatcct tctcctgcct gctgcggtct gaatgtttcc 34020 

attacccaaa actcatgtgt tgaaatcgta accccaaggt gccggtgtgc ggaggtgagg 34080 

cattcggagg gaattaggcc atgaggatag agccctccta agtggcccca gagtggggct 34140 

tcagagaact ccctcacctt ccatcatgtg aggacacagc cagaagacgc cacccgtcta 34200 

tgtaccagga ggcgagacct ctccaggcac cgactctgcc ggcaccttga tcctggactt 34260 

tctggcctcc agagcgatgg gaaataagtt cctgtcgtct ataaaccact cagtctcagg 34320 

tacctgccca gactgacaaa gtggctaccc ctgcctgtct gggtctctgt ttaccttctg 34380 

tgtgtctgac tctgtcactg tcattgtatc tttctgtgtc tctgggggta gcccctgact 34440 

ctgtctttct ccctgagtgc atctttctgt gactccttgt cactgtgtgt ctttctgact 34500 

cttacctccc tctgtcccgc tacttctctc tcccctcctc ctccttccca ctcctcgcca 34560 

gctcaagcag gcaagattta ctcatgacgg gaccagcaca gatgcaaacc ctctgtgggc 34620 

aggactttct tgggctgtaa acctggatga agccctcaga ccctcctttt tccttcccaa 34680 

tgattgtgtg gtcaccttga gatgaaacca ggccctctcc aggcacatgc tctctgtcta 34740 

tctagggctg ggcttgggcc actgatgcca ccaaggagca agggagggaa gctgtccgtt 34800 

cagcaccaca gccagccctc- ttgcccattc aggtcaatca agtgcccacc agccagtgtc 34860 

cctgctgccc aacccaaacc agaagcaagc cgggctcccg tggccctgtg ccctgtcagg 34920 

ggaagaggaa ggcgcctgct gtcacagtga aaataattta gctcttttgg tctattcagg 34980 

gcgaacctca ttcctaagca gacacgctgg cccggtttct cactagtgct cgataatcct 35040 

tttggctggg tgcagtggct catttaactg taatcccagc, actttgggag gccaaggcag 35100 

gtggaacacc tgaggtcagg agtttgagac cagcctgacc aacatggtga aacccgatct 35160 

ctactaaaaa tataaaaatt agccaggcgt ggtggcaggc acctgtaatc ctagctactt 35220 

gggaggctga ggcaggagaa tcgcttgaac ctgggaggcg gaggttgcag tgagccgagg 35280 

tcgcgccatc gcactccagc ctgggtgaca gtgtgagact ccgtctcaaa acagaaagaa 35340 

aaagagagag aggaagaaag gaaggaggga gggagggagg aaaagaagaa aggaaaggaa 3 54 00 

aggaagacag acaaggcaga agtaatcaag cctttcatgg tgagctgggt cttctggtga 3 54 60 

cagtgcagag aatggtctgt cctgacttaa atttcctggt gacctacact tttctggaca 35520 

gagcagcaca gagcccaaga gggtgtaagg aggagcagaa aggaatccca gggtgggcag 35580 

gcccgtgcga gagcctttgg gggaaggaat gagactttga gccgggaagc gaggcaaagc 3 5640 

tacctgtctt ggtcattgtc ttcagggagg gagatggagg gggaccaggt gggggagcct 35700. 

cacaggggac tttggtctga cttgtcaagt tttctttttt tctttttgag atggagtctt 35760 

gcactgttgc ccaggctgca gtgcagtggt gcgatctcgg ctcaccgcaa gctccgcctc 35820 

ctgggtt'cac accattctcc tgcctcagcc tcccgagtag ctgggaccac aggcaccgcc 35880 

accacaccca gctaattttt tgtattttta gtagagacgg ggtttcacta tattagccag 35940 

gatagtctcg atctcctgac ctcgtgatcc gcccgcctcg acctcccaaa gtgctgggat 36000 . 

tacaggtgtg agccactgtg cctggcctac tttatttttt agaaacagga ctgtgctctg 36060 

ttgcccatgc tggagtgtag ggtgcagctg tgcggttcac tgcagccttg aacttctggg 36120 

cttgacggat cctgccatct tagcagctgg gactacaggt gcatgccagc acaccagttt 36180 

tctttttttt tttatctctg ctcactgcaa ttccgcctcc tgggttctag cgattctcct 36240 

gcctcagcct cccaagtagc agggattaca cgcacatgcc accacacccg gctaattttt 36300 

gtatttttag tagagacagg gtttcactat gttggtcagg ctggtcttga gccaccgcgc 36360 

ccgcccggcc tacacaccag cttaaaaaaa agaaaaaaat agctgggcgt "ggtggctcat 3 6420 

gcctgtaatc ccagcacttt gggaggctga ggcaggcaga tcacctgagg tcaggagttc 36480 

aagaccaacc tggccaacat ggcgaaaccc tgtctctact acaaatataa aaatcagcca 3 6540 

ggcgtggtgg cgggctcctc taattccagc tacttgggag gctgaggcag gagaatcact 36600 

tgaacccggg aggtggaggt tgaagtgagc caagatcgag ctactgcact ccagcctggg 3 6660 

agcaagactc ccgtctcaaa aaaaaaaaaa aaatttgtag tggtatggag gccgggcatg 3 6720. 

gtggctcacg cctgtaatcc cagaactttg aggggccaag gcgggcagat catgaggtca 36780 

ggagttcgag accagcctga ccaacatgat gaaaccctgt ctctactaaa aataacaaaa 3 6840 

attagccagg. catggtggcg ggcacgtgta gtcccagcta ctcgggagac tgagacggga 36900 

gaatcgcttg aacccaggag gcagaggttg cagtgagctg agatcacgcc actgcactcc 36960 

agcctgggtg acagagtgag actctgtctc aaaaacaaac acaaacaaac atatatatat 37020 

atacatgtat atatataata tatatatacg tatatataca cgtgtatata tataatatat 37080 

atacgtatat atacacgtgt atatataata tatatacgta tatatgtata tattaatata 37140 

tatacgtata tatacacgtg tatatattaa tatatatacg tatatataca cgtgtgtata 37200 

tattaatata tatacgtata tatgtgcgtg. tgtgtatata tatatgtata tatatatata 37260 

tatataca.ta. tatatataca gagagagaga gagtagtgat aggtcttgct gtcttgtcca 37320 

ggctgatctt gaactcccgg. cctcaagaga ccctcccacc tcagcctccc aaagcactag 37380 
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gattataggt 
tgatgaggga 
tggggacatt 
gagagagcaa 
aggtagggac 
ctccaggccc 
gccgggcgca 
cgcttgagaa 
aaaatataaa 
ctgaggcagg 
cactgcgttc 
gtgtttacca 
aagcctgaaa 
cagggaggct 
gggcaggtgt 
acctctcaca 
gggttctgca 
gcttggcttc. 
aggagcaagc 
tctggctaca 
aaetctggag 
ggcagggcat 
tcttattaca 
cagctggtgc 
tttttgtttt 
atctcggctc 
cgaatagctg 
agagacgggg 
ccgtctcagc 
ttttttttta 
ggctcactgc 
agctagaatt 
tgaggtttca 
cctcagcctc 
tgctgggatt 
ttctcaccgc 
ttctcgcagt 
accgaggaaa 
aggttggcct 
acccctcccg 
ctcccgccgt 
cgtcgcccag 
cacccgggtg 
gcccgctccc 
cgctcccagg 
cccagggtca 
gggtcacacc 
ttctccttct 
tagaaagagt 
tgcaacctcc 
atcacaggag 
tcaccatgtt 
ctctcaaagt 
ggaatgaagg 
agcccagtca 
gatctgtcca 
ggaggctgct 
tcatggtgcc 
ggaaactcag, 
cttagcaggc 
ggagaggtgc 



gtaagccaca 
gttagagggt 
tagaggagct 
gagggagttt 
ttggcccctc 
ctacccttca 
gtggctcacg 
caagagtttg 
aattagctgg 
agaatcactt 
cagcctgggc 
taaaggccag 
ttt.tcaccga 
ctctcccagt 
gagcacctgc 
ggtcttggct 
gctgcagaca 
tcccagctgg 
gggaccaggt 
gggaggaagg 
agagcaagca 
tggaccagta 
gagagaggat 
cagggaagcc 
tttggagacg 
acggcaagcc 
ggactacagg 
tttcactgcg 
ctcccaaagt 
agacggggtc 
aacctctgcc 
acaggcacat 
ccacgttggc 
ccaaagtgct 
acaggcgtga 
cccctcccca 
gagctgggct 
■ tgcggattgg 
caagcagcag 
ccgtccaggt 
ccaggtggat 
gtggatgg.tg 
gatggtgccc 
agggtcacac 
gtcacacctc 
cacctcacgc 
ccacacccac 
ctttcgtctt 
ttctactctt 
ctctccccgg 
cgtgtcacca 
ggccaggcta 
gctgggatta 
ctcagagagg 
tgagcctggg 
gaactgagag 
gccggctccc 
aggagcacag 
aaatgtggcc 
aagcggccaa 
cagcaccgtc 



gtacccagcc 
gtgccagcca 
gaagaggtgg 
gggctggggc 
caggcctgac 
gctcatcctt 
cctgtaatcc 
agaccagcct 
gtgtggtggc 
gagcctggaa 
aacacagcga 
aaaatgtgat 
aaaaaagatc 
ttgcctcacc 
atcatcagga 
ctgcccagga 
caggcgcggc 
tggcccacga 
acccacagga 
gcagattgga 
aggggaagtc 
ttttcagagc 
gccgaggccc 
cctcccagag 
gagtctcact 
ccgcctcccg 
cgcccgccac 
ttagccagga 
gctgggatta 
tcactctgtc 
tccccagttc 
gccaccatgc 
caggctggtc 
gggatgacag 
gcccccgcgc 
ccatccttcc 
tgttttgtct 
cctttggatg 
ggtatagtag. 
ggatggtgcc 
ggtgcccact 
cccactccca 
gctcccaggg 
ctcacgccca 
acgcccaccc 
ccacccctcc 
ccctcccgcc 
cttcgtcttc 
gctgcccagg 
gtcaagcaat 
cacctggcta 
gtctcgaact 
caggcatgag 
gcaagcattc 
cctgactgca 
gccaggggac 
cccgggggca 
catggcagga 
accacagtca 
gcacctttcc 
atctctaccc 



tattaaaaat 
tgtgttccac 
ccaaccctgt 
agaacgtacc 
tctgccagca 
ccttatcaca 
cagaactttg 
aggcaacatg 
acatgcctgt 
ggcggaagtt 
gactctgtct 
taacagctgc 
tgtaggctca 
tcagcaagca 
aagaaggggc 
gacacgtgtc 
tcagcatctg 
ggagcttcta 
cggagcctgg 
gggggcagga 
tgcgcacagg. 
tggtgaggct 
agacccatcc 
cctcaaggca 
ctgtctccca 
gattcacgcc 
cacgcccagc 
tggtctcgat 
caggtgtgag 
acccaggctg 
aagtgattct 
ccagctaatt 
ttgaactcct 
gcgtgagccc 
ccggcccctc 
ccttcactga 
ccctgcttct 
attcatgagc 
tgcccgctcc 
cactcccagg 
cccagggtca 
gggtcacacc 
tcacacctga 
cccctcccgc 
ctcccgccgt 
cgccgcccag 
cacccgggtg 
ctcctcttct 
ctggagtgca, 
tatcctgcct 
attttgtact 
cttgacctca 
ccaccctatc 
cacccagcat 
ggctcctgtt 
cacagtggcc 
gatggaggtc 
gccaccagcc 
ggctgcttga 
tgaaatattg 
agataaggag 



taatgttaaa 
agcagcaggt 
gctcaggagg 
tgggtcctga 
accagctccc 
catccaaaac 
ggaggctgag 
gtgaaacccc 
tgccccagct 
gtagtgagca 
caaaaaacaa 
tcaaagcccc 
tacagaggaa 
gacggctggc 
acggtgggga 
caactgagag 
atggccacgg 
tcccaagaga 
gagtgaggcc 
cagcaggtca 
gcagggctgg 
taaagagcat 
aggccacctc 
ttgctccctc 
ggctggagtg 
attctcctgc 
taattttttg 
ctcctgacct 
ccaccgcgcc 
gagtgcagtg 
cctgcctcag 
ttctgtattt 
gacctccggt 
ccgcgcctgg 
cctctctttg 
cttcagggag 
ctttgtacta 
aggagtcaga 
cagggtcaca 
gtcacacctc 
cacctcacgc 
tcacgcccgc 
cgcccacccg 
ccgcccgggt 
ccaggtggat 
gtggatggtg 
gatgccctta 
ttcttctttt 
, atggcacaat 
cagtctccca 
tttagcagag 
gtttatccac 
tgcctcactt 
cacacagagt 
.gggagctcgc 
tctgacccct 
actgtcaccc 
gatttgcctt 
cgtgcggtga 
aggcctcaga 
acccaggtcc 



caagaggatg 

caggagacat 

acgggggagg 

gaggataaga 

tatcagcaga 

tctgaatgtg 

gcaggaggat 

atctctacta 

actcaggagg 

gagattgtgc 

aaactggaat 

tgtctgccct 

ggacaaacac 

agcaatttgg. 

cgcaggtcag 

gtgaggaact 

agaccccctg 

ctgtccctca 

tgccctgtgg 

ggaattggcc 

tcaggggcga 

gtctactgcc 

tccacagaca 

tctctctctc 

cagtggtaca 

ctcagcctcc 

tatttttagt 

tgtgatccgc 

tggacttttt 

gcgcgatgtc 

cctcccaagt 

ttagtagaga 

gatctgccca 

ccccccgcag 

actcccttct 

ttaaaaacaa 

aacattagat 

aaaaggcacc 

cctcacgccc 

acgcccaccc 

ccacccctcc 

ccctcccacc 

ggtggatggt 

ggatggtgcc 

ggtgcccact 

cccactccca 

tcagctctcc 

tttttttttt 

ctcagctcac 

gattgctggg 

aggggggatt 

cggcctcagc 

ctacagagga 

gccgggtgag 

ggaggtgggg 

ggagggccct 

aggctgcttc 

tccctgggca 

gcactcatct 

acaag.cccca 

tgagaggtta 



37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 

38760 

38820 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 

39360 

39420 

39480 

39540 

396,00 

39660 

39720 

39780 

39340 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 

40440 

40500 

40560 

40620 

40680- 

40740 

40800 

40860 

40920 

40980 

41040 
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ggcagctcgg 
tgtgagcaga 
ctcccagctc 
ggaagggcag 
ccctccgctg 
ggaggaagcg 
cacggtcccc 
ggggcggggt 
cccgagttga 
aaaaggtggg 
ttcgtcctgt 
gagcagagcg 
tcccctgcgt 
cgcgcgggac 
cgcgggaggt 
gcagtcgccc 
agcaggtggc 
gcgcggtccc 
cctccagccc 
agagggtccc 
cgccggacgc 
cacgcccggg 
caggggcgag 
gcagcccagc 
cccaggcttg 
cccaggcgag 
ccggagccta 
tggagaggcg 
gaggggagag 
agggagggag 
gggagggttg 
agggaggaag 
ggctctcagg_ 
cctccaaccg 
tggggcgggg 
gaggagcagg 
aggctgggct 
ccacaggggt 
tgggcaggat 
gccgtgaaag 
caagggcaag 
tgcccccaga 
aacctatcct 
gagcgtcttt 
acaagcaact 
gccatgaggc 
tgggagcacc 
gcaagccggc 
attgtgtggc 
cacaggagcc 
caaagaaggt 
cccagccaac 
aagaagagaa 
caactgccca 
tgttttaagc 
acagaggtgt 
gcagagaggc 
gtagcagttg 
aggtgctagg 
tggggtagga 
ctggccaggc 



acaacaccac 
ggccacaaaa 
agccccagga 
gaggcggccc 
agagcccccc 
cagatggaca 
tcctggcagg 
caggtcgggc 
accagccagc 
agggagtgcg 
ccccccaaac 
ttucgggaag 
ccagaacccg 
gggccgggag 
ggaggggcgg 
ctcccccatc 
gcgggcgtcg 
cctcactccg 
ctgtgctgca 
accctctccc 
cccagggacc 
tctgggagcc 
gcgcccccgc 
gccccaccag 
caacgcgcag 
gccagacgag 
ccggaaatgg 
caagtggcgg 
gagtggaggg 
acagagggag 
ggggaaggag 
ggagggagga 
gggagagggc 
ctcacagcgg 
gtggagatga 
tcccacccat 
ggcgccaaca 
gggcgctggc 
ctgagcggag 
aggtggtttc 
cgtctgacct 
cgatggagag 
gcctccccct 
gccattgcag 
aagccacagg 
. tggctcagca 
caaggccagt 
atcatccccg 
ccagaggtac 
ctttaaaagc 
tttctgttgt 
agccagcaag 
tgggcttgga 
cttggtttca 
tgttggtttt 
tggcccctgg 
acacagagct 
gggatggtca 
gatcaagaga 
cccacccagt 
cctctctgag 



acagctggag 
gagcgaggag 
accgagccca 
cggagaggac 
cacccccagt 
ggactcccag 
tgctgaagtc 
acgcgtgggt 
tcgggagagg 
gctcgtgaac 
gtcgagtgaa 
ggcgggccca 
ccccccgccc 
gccggggtgg 
gaggggcgga 
cagaccccgc 
caaagggtgg 
ggctccgccg 
ctggcgcggg 
cgcctcccca 
ccgcccgcac 
accctgcggc 
ccgacgtccc 
ccccgccggc 
ggtaggagaa 
gcctccgctc 
tgctggccat 
ccggagctgc 
agcgagggcg 
agagagagag 
agagagagag 
agagagagag 
ggaggagggg 
gacagctgct 
gcccccattc 
ggggtggggc 
gggtccaggt 
ctggaggcct 
aagagtgaaa 
ccgcctgggc 
cgctgggcct 
gaggggcgcc 
ctcctcataa 
ggatgaagcc 
gccaggagca 
acacaaaagc 
cacatcctag 
ggactgtgag 
ccctaagact 
agtttccttt 
tgagatgagg 
acaacaagac 
ggcaggtttg 
gccttgggtt 
gtggtgattt 
gccattccca 
gcccaccaca 
gcatccttca 
agtagccctc 
caggctcaga 
cctctgtgtc 



gaggtcagac 
ccagtgccca 
tggggaggga 
ggcgctgccc 
atccccgggg 
atagggtggg 
acctggagc.c 
gggcggagtt 
gaccgcggcg 
gggggcggcg 
aaacgaagcg 
gcgtcctcgc 
gggcctgcgc 
gcgctacctt 
gccctctggt 
ggcgcaaagg 
cccccgaggc 
tgtctggccc - 
gagcgccggg 
cgaaggctct 
atcgcgagcg 
agtcgcgccc 
ggtcccgagc 
gccgcagacc 
gggaaattgg 
agatcccgcc 
ggtgctggcg 
agacggctgg 
ggcgggaggc 
ggttggggga 
agagagactg 
gagcaagcgc 
ggctacccga 
tctccaactt 
cccagggcag 
cgcagagccc 
ccgtgcctgc 
ccaaggggca 
gtggaggagt 
tcagaccttc 
ttgtttctca 
agggttgtgc 
ctcccaaagg 
accgaggcag 
aaaccctgaa 
caggggcaag 
ggtggcctgg 
tctggtgggg 
ggaggatcac 
gcctggttga 
gggccacgtg 
cttaactgca 
accccagggc 
actaagaatt: 
gctatgaagc 
ggaagccagc 
gcagctgtgt 
ggcagactcc 
tctgggacct 
gcaccgcaat 
ctcatctgga 



tccgggttgc 
gatgccgaga 
ccgtcaggga 
tcaggggcag 
gtgtccagga 
gaggtgtggc 
tccaagcccg 
ctgcgccccg 
acctgtcccg 
atgggaagga 
ggttctgcgg 
gcccgaggtc 
ccgcccctcc 
cgaaggcggt 
ctccggaggg 
gcagtggctt 
cgcagcggt'g 
-gcccccctcc 
ttcccggctg 
ggcggaccca 
cgcccacccg 
tgcgtggcac 
gctccccggc 
ccagcctcgg 
cgtccgctgc 
gccccacaaa 
gcggttgggc 
tgctgcagtg 
gggcgcggcg 
aggagcgggg 
cgggggcggg 
ctggctgcgg 
actgcaacaa 
ggctttgtga 
gcggggcagg 
ttcgccgcca 
catcggagag 
tctcctgtga 
gaggccagaa 
actcactgtg 
ggggtaagat 
gcacttagtg 
gaaag.cctgg 
ggagaaaagt 
aacctcagga 
cctcagctct 
agagtcctga 
gtgatgccca 
caggcgggcc 
agaaatcgga 
gcaaggatct 
aggaagtgag 
ctccacacaa 
aggaggtaat 
catatcaaac 
tctgcgaagg 
cctccctg.tt 
agccccgggt 
ccagagtctt 
gcctcacact 
gcacagggac 



agaaggagaa 
tgccctcgcc 
aaggctgcca 
gaggggagtc 
ggaggcggag 
cggtgacaca 

tggggcctga 
ggccaaggcg 

ggggcgtaag 

ggtgcggccc 

cctcgcggcg 

gcccggcagc 

gcctg.agctc 

gggtccgccc 

tttggggatc 

ttctggccag 

tggggggagg 

ttcagcgccc 

gggctttggc 

gatctcgggt 

gtcgcgagcc 

gccgctcccc 

gcggcgcctc 

gcgggtcggg 

cggccgctgc 

gcccgtggcc 

ctgcggaggc 

ccggggaggg 

ggagagagag 

ggaggaggga 

ggaaggaggg 

aaggggccgc 

gaccccccac 

ggcctgagag 

ggcaatgccg 

aggccgctgt 

gacgccacag 

gcccagggga 

caaaggctct 

tggcccaggc 

gaaacaatga 

agtggggggc 

taggcaaacg 

gctttgccct 

gacttgcaga 

agcagtgcgg 

cccctgacgt 

ggaatgtgac 

ctgacctcat 

gggatcaaac 

gagaactgct 

ttctgccaac 

gaactgagcc 

gaatgag.agt 

taatatacac 

aggaagaagg 

ggccaccaca 

gc.tggagctc 

ctcatgtggg 

cattgtgact 

caggtgtgtg 



4110O 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42180 
42240 
42300 
42360 
.42420 
42480 
42540 
42600 
42660 
42720 
42780 
42840 
42900 
42960 
43020 
43080 
43140 
43200 
43260 
43320 
43380 
43440 
43500 
43560 
43620 
43680 
43740 
43800 
43860 
43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
44520 
4.4580 
44640 
44700 
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gaagcccgcg 
acacaacagg 
tcagccatac 
ctagcaggca 
gagtaatgac 
gtgccaggtc 
gcatttcaaa 
gtgcagtggc 
gaggtcagga 
acaaaaatta 
gcaggagaat 
cactccagcc 
agtgtgactt 
tggcccacac 
gagttcaaga 
tagccgggcg 
atcgcttgaa 
cctgggcgac 
ggctcctaga 
aagcacctgc 
tacaggagtt 
tgactcagac 
gagatgtgtg 
caacgctctg 
ctgggcgcgg 
acctgaggtc 
aaatataaaa 
ctgaggcagg 
cactgcactc 
cagaaggtga 
gagagagaag 
aacagcctgg 
cacgtgggga 
gcatgtatgg 
gctgg.tctcc 
accacaacag 
tcacacccag 
cagccatgaa 
aactccctcc 
aaacatccgt 
ccctgctgcc 
ggacagtgag 
tctcaggagc 
gctcacactc 
-aatggtggtg 
catcaaggga 
tcactggcca 
gaagagcagg 
accggaagtg 
aaaactctga 
gttcctgagg 
agtgacccta 
aactacagct 
cgctgcattt 
gagcccttga 
cagtcccagt 
cacgggttgg 
ggcggatggg 
gaggt.tcaga 
ctcttcagcc 
ctgtgggcgt 



gcatagtgcc 
ggtactgact 
tgactgtcag 
catgccatag 
agcatgctgg 
ttggagcaga 
gatgtttt.tc 
tcacaaccgt 
gtttgagacc 
gccaggtgtg 
cacttgaatc 
tgggcaacaa 
ttatcagacc 
ctgtaatctt 
ccagcctgac 
tggtggcatg 
cccaggaggt 
agagcaagac 
aggagcccta 
tgtatgctgg 
cctgaccacc 
ttcagctcag 
gctgcaaaag 
tgcctcactc 
tggctcacgc 
aggagttcaa 
aactagccgg 
agaatcactt 
cagcctgggc 
agtctatgtt 
gggaaactgc 
aggaaactga 
ttacaattcg 
ggggcactga 
tcatcttcaa 
gcacatgggg 
gagcgtgggc 
gaagccacga 
tggggccttc 
gacgctgggc 
agccagcgcc 
ccgagcagaa 
cgggtcctgg 
gagcagtaaa 
gctcggccta 
ggagacgggc 
gcggagccgg 
cggaccccct 
gggggcggaa 
gctggctgac 
gaccccggtt 
caagtggttc 
cccggcaggc 
tgggacctgt 
ccggggctgg 
ccccgggcgg 
catcccggca 
gaatggactc 
ccctccgctc 
ctcacgctct 
cttttccaga 



aggaacacag 
gtcagcaccc 
cgtgctagca 
gtgtactgac 
caggcacaca 
ttttgacttc 
acatgcacat 
aatcccaaca 
agcctggcca 
gtggcaggtg 
caggaggcag 
tatcaagact 
tctgcattct 
aacactctgg 
caacatgatg 
cacctgtaat 
ggaggttgca 
tctgtctcaa 
tatctcagca 
agctgtgggt 
ccaggccagc 
tccacaggac 
gtaaactcct 
tctcacccag 
ctgtaatccc 
gacctgcctg 
gcttagtggt 
gaacccagga 
aaaaagccaa 
agtcccagtt 
cacttttaaa 
ccgcatgatc 
aggtgagact 
aacttgtgct 
ctccctgccg 
ctgggtgcca 
acgcatrggca 
agaagtgctg 
tggggccttc 
tgaccccaca 
gggtgctggg 
gccggaacgc 
gcaaggcgca 
aagacgcctc 
gagaggcggt 
gggaccccag 
acacagaacg 
ccgcggggga 
gtgcagtggg 
agctggggac 
ctggaaggct 
ttttacccga 
cccaagccgc 
agttccctgc 
agggaagggc 
gattgaggct 
tgctctttcg 
cagccagcca 
cgacgttgcg 
tgtggaagtc 
gaaggacgga 



tagatgtgca 
aggcaggcac 
ggcatacaca 
tctcagtgca 
atagctgtac 
tcaccaagat 
tagtgctagt 
ctctgggggg 
acacggtgaa. 
cctgtaatct 
aggttgcagt 
ccacctcaaa 
tgaaattaac 
gaggctgagg 
aaaccccatc 
cccagctact 
gggagccgag 
aaaaaaaaaa 
ggacactcag 
acgtcagcaa 
acgctcctat 
agccttttct 
ggctcctgag 
aaagtgtgga 
agcactttgg 
gccaacacgg 
gcacacctgt 
ggtggaggtc. 
aactctgtct 
cccaggtcgt 
ccatcgggtc 
caaccacctc 
tgggtggaga 
tggtgcccac 
agtctcagat 
gtgagtgctzg 
ttcgcacg.tg 
cctgccggcc 
tggggcatgg 
gaacggtgcg 
gactcaggga 
aggagaggaa 
gccgttttca 
tggggaggag 
agtggaacg.c 
atttctgcct 
cgcaaaacgc 
acagtttccg 
ctcagcgccg 
gggtggcggc 
cgccgcggag 
gcggctcgta 
cctcggggcc 
gctcgtggcg 
cgacactcag 
tagagaagtt 
ctactttggc 
ggagggcaga 
cctggtgagg 
gcggaattac 
gttgtggggc 



cagtgtgcac 
acgcaatggg 
acagctgtac 
ctggcaggca 
tgactgtttg 
caaacgcaga 
taaaaaaatg 
ccgaggtggg 
accccatcta 
cagctacttc 
gagccgagat 
aaaaaaaatg 
tctggcttgg 
tgggcagacc 
tctactaaaa 
caggaggctg 
atcgcaccac 
gaaagaaaga 
tcattcaaca 
ttagaggaag 
agcagctggc 
ggccactgct 
caggctctgg 
caatgagagg 
gaggccgagg 
ccaaactcca 
aatcccagct 
gcagtgagcc 
caaagaaaaa 
acatggcggc 
tcctgagcac 
cctccaggtc 
cacagagccg 
tcattcaacg 
aggcagcctg 
gggcttcccc 
ccgtcagtgg 
gtgcgcggtc 
gagctggggc 
ggcctcgctc 
ggcccgccgg 
ggagaggggg 
aattttcagg 
gcccgtgcag 
agaccctggt 
gtgggcgatg 
cgtgtaggcc 
ccgggagcac 
actgcgcgcc 
cctcgactgg 
acaagtgagc 
ggcgcgttgc 
gcgggtcggc 
ctggcgccgc 
cgtgtccgcg 
gagtgatttg 
tgcatctggt 
gggctggaga 
ccgggagggg 
tgcaggcgga 
gggaggataa 



tagcaggaac 
gcaccgactg 
tgacagcaca 
cacgcaatag 
ccccaatata 
aagtgcacga 
ttttgactgg 
cagatcacct 
ccctaaaaat 
ggaggctgaa 
cccaccactg 
tttttcataa 
ctgggcgcgg 
acgaggtcag 
atacaaaaat 
aggcaggaga 
tctattccag 
aattaactct 
gacatctgtc 
agggcagggg 
aaggagcaga 
ctcaggagat 
gcaatctgct 
acttatctgg 
cgggtggatc 
cctctaetaa 
acttgagagg 
aagattgtgc 
agaatcatgg 
aggagaaaga 
tcactgtcag 
cctccctcca 
aaccacatca 
agtgtgtgtg 
cagttccttc 
gagcactatc 
acattaaaca 
acgcagcgcc 
cgcctgagac 
Ctggagtcag 
gaccactgcg 
cggtcagggc 
aaagcggtcg 
ctctccgggc 
gggggaatga 
gaagtgaggt 
tggaggagcc 
aaagcaacgg 
cctgcccgcg 
agtcggttga 
agtgagtcgc 
ggtttttcga 
ggattggccg 
ggcgttggct 
tctgttctgt 
ctgagggctg 
tgcccacccg 
ggcagggccg 
tgccgcttgc 
acttgcagca 
ggcaaggccc 



44760 

44820 

44880 

44940 

45000 

45060 

4512.0 

45130 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 

45900 

45960 

46020 

46080 

46140 

46200 

46260 

46320 

46380 

46440 

46500 

46560 

46620 

46680 

46740 

46800 

46860 

46920 

46980 

47040 

47100 

47160 

47220 

47280 

47340 

47400 

47460 

47520 

47580 

47640 

47700 

47760 

47820 

47880 

47940 

48000 

48060 

48120 

48180 

48240 

48300 

48360 
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agccacttcg catcttcgcc ccgccagctc ct-cgagatgg gatataccag ggttgctctc 48420 

caaccctctc cgcaggaggg actgatggaa acgcctggga aagtagcccg gtacccacaa 48480 

aggcngtcta caaacagagn cctaccgtct ttcccaggtc cgtg.ccatag ggattctcga 48540 

agagaacagc gctgtgtccc agtgcacatg ctcgcatcgc tnaccaggag Cgcccgagac 48600 

cctaagatgt tcggagtggt tntttcgcac agacccgaat agcctgcccc tcagccacgc 48660 

tctgtgccct tctgagaaca ggctgatatg cccaagatag tcctgaatgg tgtgaccgta 48720 

gacttccctt tccagcccta caaatgccaa caggagtaca tgaccaaggt cctggaatgc 48780 

ctgcagcagg tagagcacag gccccgagga aaggactgcg ggtgggtgga gcttcagcca 48840 

ggacggggtg tgcttccctc tcccggccca ttccagccag gcccctccgg gccagaggca 48900 

gcgtctgtca taaaaagggc tggtgttcca ggtggggtca gagagaggat tgacaagtaa 48960 

aaacgatcgt cctttgaagg gggccggccc ctccacacct gtgggtattc ctcatcaggc 49020 

gggacgagag accgagaaaa tgaataagac acagagacaa agtatzagaga gaaaagtggg 49080 

cccaggggac cggcgctcag catacagagg acctgcaccg gcaccagtct ctgagtttcc. 49140 

ccagtattca ttaattacta ttttcactat ctcagcaaga ggaatgcggc aggacagcaa 49200 

ggtgatagtg gggagaaggt cagcaagaaa acg.cgagcaa aggaatctgg gtcacaaata 49260 

agttcaaggg aaggtaccat gcctggatgt gcacgcaggc tagttttatg cttttctcca 49320 

cccaaacatc tcggtggagt aaagagtaac agagcagcat tgctgccaat atgtctcgcc 49380 

tcctgccaca ggg.cggcttt tctcctatct cagaattgaa caaatgtaca atcgggtttt 49440 

ataccgaaac attcagttcc caggggcagg caggagacag tggccttcct ctatctcgac 49500 

tgcaagaggc tttcctcttt tactaatcct cagcacagac ccttcacggg tgttgggc'tg 49560 

ggggactgtzc aggtccttcc catcccacga ggccatattt cagactatca catggagaga 49620 

aaccttgggc aatacccggc tttccagggc agaggcccct gcggctttcc gcagtgcatc 49680 

gtgcccctgg tttatcgaga ccggagaatg gcgatgactt ctaccaagca tactgcccgt 49740 

aaacatattg ttaacaaggc atgttctgca cagctctaga tcccctaaac cttgattcca 49800 

tacaacacac gtttctgtga gctcaaggct ggggcaaagt tacagattaa cagcatctta 49860 

gggcaaagca attgttcagg gtacaggtca aaatggagtg tgttatgtct tccctttcta 49920 

catagacaca gtaacagtct gatctctctt ttccctacag tccttgaggg tgacagactt -49980 

aggagtgcct tgggggcctc tctgaggagc agctgatatc cacgggtcag gaggaagcat 50040 

ttccattaga ggggcagccg gtggccagcc tcacttggaa ggtctttgaa cctcgggggt 50100 

gcagggaggc ggcagtggtg caggtcgcct tcccccgggt tccttgaggt gccctcttgt 50160 

acccggctca cacccccccc ctccccgagt ctcctgctca ggttcccgtc Cgagagcttg 50220 

catgtaggac gtcagatagg acagcataaa tgtttggatc cagaaacgca gaacagttcc 50280 

ctattttgag acttgacacc taattagtca tcttactatt taagctgaaa aatagtgtcg 50340 

tgttttgggt aacgttctgc aaaCcgtttg ctaatggcgg etgagttgct tcacgccctt 50400 

tagggcaaga gtgggacttg cctgtggact tctccgcggt cccacagggc tctcgccacc 50460 

tggcagtggc ctctgcatct gcaaagagcc gcccgctggc cgccgaagct tgtcccaggg 50520' 

cagcttgtgt ggcctcgcct cttcctggct tccccgtaac cetcgctccg aactccgttc 50580 

agaaggtgaa tggcatcctg gagagcccta cggg.tacagg. gaagacgctg tgcccgctgt 50640 

gcaccacgct ggcctggcga gaacacctcc gagacggcat ctctgcccgc aagattgccg 50700 

agagggcgca aggagagctt ttcccggatc gggccttgtc atcctggggc aacgctgctg 50760 

ctgctgctgg agaccccaca ggtgacccta gttcccaggc ctctcctggc ctcctgtggg 50820 

gatggttggc aagggatggc gctgagggtg gggtgggccc atggggactc ctgccgtctc 50880 

tcaagcagaa ctcaaggaga attttttagc tgctgtataa tttctcgcca tcgtgggtgt 50940 

aaacctaggg ttgggctttt ttgctgaatt agggcacggc agatgcccac ttcacccatt 51000 

. ttfgataaac cagtatctgg ggtgtcagat tcttggctgt ctgcagggcc gagttagccg 51060 

aatgccacct gcctttgata cgtgagaacg ttgtctgaga accgtgactt ctgtgcttgc 51120 

ttgtgtctgg tcagcttgct acacggacat cccaaagatt atttacgcct ccaggaccca "51180 

ctcgcaactc acacaggtca tcaacgagct tcggaacacc tcctaccggt gggtcagacg 51240 

agtttacacc tgtctcgggg tcctcaagag aaccagcttg gcatggtgct gagtccacag 51300 

ccccatgctg tgctgtggtg gagggtggtg gtctttctag acgctccccc gaagtgtgca 51360 

gagcgctggt gcccaggggt ggggtgcggc ctgggctgcc tccaatgccc attacctgtg 51420 

aggaagcagc tttgcatctg tgtgctgacc ttgggcgggc gtcctgagct cctcgcaggt 51480 

gctgttgtag cagctgtgca gtaggtcagg gctggccccc agtgcagctt tgcacatgaa 51540 

gcaggaggag gccctgctgc ttgtcagagc ccagcagagt cttggtgttc tgtcgggttc 51600 

ctgtggccgg accagtggca gggtgctg.tg gaagctgtcg aatctcctcc ctctgtccag .51660 

tacccccgct cgtcttctag ctccctccta cgcccgggcc acgtttcagt tatgctcact 51720 

tcctctgacc gccgaggctc ctgcgtgtct ccatacagct cacgctgcag ggccacgctg 51780 

tgggtgttgg agacagctcc tcctcgaccc acggtgctct ctcccaccag gcctaaggtg 51840 

tgtgtgctgg gctcccggga gcagctgcgc atccatcctg aggcgaagaa acaagagagt 51900 

aaccatctac aggtaggctc ctgggctccc gctccggctc agtgtccgac aggcgagtgc 51960 

tgctgggtgt ccagagcccc aggc.tgcg.ct cccgctgggc tagggcttga agttcactgg 52020 
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gggactgcag gggaggacct ggtgggggtg gggactggct tcggtccttt cttggccgtg 52080 

cttcagctgc gcactctgcc cttcctccca cagatccact tgtgccgtaa gaaggtggca 52140 

agtcgctcct gtcatttcta caacaacgta gaaggtacaa gcagctgggt gggaccaggg 52200 

tcgggttgga gtgtgtgcag cctctcaggg tggagctcag tggtgtcaca gcctggttgt 52260 

gcttgcccgg tggggcggcc agtgcggcca tgtacctggg ccctgtcttc cgactcgggg 52320 

ccacccatgt tagacttctg tgtggaagag ctcacacagt ggtctgagac agccagccgg 52380 

caagactgcc tctggctggt gcccggggcc ttggattttg ggaaggctcc ctccacttcc 52440 

tgatgagagg gtctccctgc acctaacctg ctggtgcaaa cagtaggggt tttgctgaac 52500 

accggctttc tcttcgggga ctttgttgct tgcccagcag caggtgctcc agtgaccggc 52560 

cctcacacca tcttgggagg gtgtcctgga agccgtgccc ggcctcccgc gaccccgccc 5262 0 

cgtgtgtctt tttcctgtgc tgaccttgct gcggaaaatt atggccctga gtgtgactcc 52680 

aggctgagtc ctgtgggtcc aacacgggaf gccttggggc ctcttctgga gacgggatg.t 52740 

gagtgacagg agccggccgg ggcagcttgc cctgtgactg cacgtggcca cagcctgtga 52800 

gggccggggg tgcttctcca cccacgtggc tgcccctcgg gtatgtcaag ggcttctggg 52860 

gctcatcacg gggtcctaga gacagtggca gggtgcaccc ccgttggctg cccttacagt 52920 

ttctgtgacc tgagggtggc atctgtgcag tcggcgcggt ctgtgcttct gtgggatcag 52980 

ggttccctct gtttcctgcc tcagttgggg ctcaagcctc aggtgaggtg gccccggagc 53040 

actcagaagg catcggcggt cctgtgggct gctttctgca ctcacgtttg ctgagtgctc 53100 

agtgtgccag gactgaggac cctgaagctg ctcttgtatt tagggcggcg ctcccctggc 53160 

agagactgag ccaggtggtc ccgcatgacc cactaccagg cgtttctggg ccctggccct 53220 

tggagggaca gggtgggcgg aacatgggcc tgcagggagg ctcccgctta ctggaggcat 53280 

gtgctgtgtt gctggagaca tcctctgtgt tgcttcttgt tcgctgtggt ttttggtctg 53340 

gtggcaccaa ggaccctcag tcatcttgat gtgtggttgt ccaggccttt ttgttggtcc 53400 

taagaagggg ctctgccttt gtgcccccag gttccctgac aggagctgcc ggctcgtccc 53460 

ggtgatgcct gcaggacgtg actctgggac ggggggttgg gcagatgtgc tgatggaaat 53 520 

tctcaagcag gcgtcatttc cgaggtcctc acctggattt ccaggacagg agtgcctgct 53580 

'gggtgtcccc agtcccatgc agcgggggtc cttgggatag catggaacgc tgagcatggg 53 640 

cctggccggc cgtggtcctg gacaagggca gtgccccggt ggctgctggg cctgggacct 53700 

ggtggggacg ctgggcctgg tacctggtgg ggatgctggg cctgggacct ggtggggagg 53760 

cctctgactg cctcctggtg ctgcttccgt ctgtgttagg cctctgggta ttggggcccc 53820 

catctgtctc ctcctccagg cctgtggact cagaccagga agacacaggc cagcccctgc 53 880 

ctgtccccct tggcttgggc tctcactgcc cgacctggcg ggaggttgcc tagccgtgaa 53940 

ccttcgcacc ctgtctgcca ccggacaggc tgtgaggggg tgtctgcagc acctgcaccg 54000 

gcctgagcat cttcagagtg ggctgcagct cctggagggg tctgagagga agggaggcag 54060 

gtattttggg cgaatgagga gacagctgga gagctggcac ccttcctggc ctgcgtcctg 54120 

tgaggactct ggttggggac agcaagcttg gggtcagcct ggggcagagc ctctgggacg 1 54180 

gccccgcccc tcgtgcccct tcccctcgca gctcctgtcc tcgccccgcc ctcagctctc 54240 

cgccaggcaa ggtttggcaa gtgccgctgt gcggcagtgc ctgctgattg gctggtctgt 543 00 

tgctatggtg ctgcccaggg gtgtgctttt cctcccctgc cttccctgct atccctggga 543 60 

gtatctgggg ttgggtcatc gctggtgtgt gtgagtgtgt gtgtgtgtgt atgtgcacgt 54420 

gtgcatatgt gtgcgcttct ggcctctgca gctgagtcct ggccctcggg gggcctggca 54480 

cctcctgggg acaggcacaa agcagccatg atggagtcgg gagctggggg aggccccatt 54540 

gccccacgtg gctgccctgt gactctgggg tgcttgttag aagaggtatc tggttctgtc 54500 

tgtgtttaag caactcccta aggaattctt gtggttccag tttggggggc ctgtactgta 54660 

gaggcaaggg aggggcagga catcccccag actctgactt ctgaagcctt ctccgcccgg 54720- 

ggcctctccg ccagtacagg cagtgtcctt tgccagggct gccatgctgc agaggggagt 54780 

gggccactgt ttagcccagg aaaacctggc tctcccttag ctggaagttc tgggcctgtt 54840 

gtggttggca gggaagctga gtgacggtgc taatcacagg ggcacctgca ggggtttgtg 54900 

ggagatgcct ctgtgggttg gggcgatagg ctgaggggct gttcttccct gccctgagga 54960 

gggctgagtg tagccgccac tcctgtcctg tcttgggctg tctcggagag gatgcgtaga 55020 

accctcggga tcctgctggc ctccgtctgg tccaccctga acctcaggcc ttctgggggc 55080 

agaggaggat tccctcagga tcactcgggt gggggcctct cttgggcacc tgagaccctc 55140 

agtgggtgct ttgtggcgcg ttcacggttg gtgggggacg cccagccctg cccgccgtgt 55200 

aggagccgtt ctgtcctggg catccccctg tggtctggga cttagtggac cctgagggtg 55260 

tgtgtttacc cctgcctcac acctgcagaa aaaagcctgg agcaggagct ggccagcccc 55320 

■atcctggaca ttgaggactt ggtcaagagc ggaagcaagc acaggtgaga cccctcagtg 55380 

aggccacgac cactgtcctt ccatggccca gctctcctgt gacctgtgga ggcccggata 55440 

tatttcttca cttttctttg ttccttttta aattatgaaa ctaaccacca ttcagtacga 55500 

aaaagtttaa gcagctctga ggaagataga gtaaaaaatt gtctccctct tccctggccc 55560 

tcagccatcc ccggtggcca ccgtggagtg tggacggagc- cctgcaggcc tgtgtctgtg 55620 

cggaagcacg cgcagttttg tctgcacaga ctgtcctgca gttggctgtt ttcactcagc 556S0 
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gttgtgggta 
cagatgtctc 
cttatattga 
cccaggctgc 
agcgattttc 
tggttaattt 
ctgagctcga 
ccccatgcct 
gccgcccagg 
ttcacgccat 
cgtctggcta 
gtctcgatct 
ggagtgagcc 
gtttctatcc 
tttacatggt 
ccccatccct 
cctgggccct 
gcaccgtccc 
ccttactacc 
tacttgttgg 
ggtgtccctg 
caggcctggg 
. acttgtgagc 
ggtgggagtg 
tgacagctca 
ccaccgactc 
gtgcccggct 
ccccacagca 
actcccttcc 
ccccacactc 
tgccctgctt 
caatggtttg 
tcgctcattc 
atagcaagac 
gcaaaagagt 
ggagaccgag 
tctgagggga 
ggggccgact 
gttgtgggtg 
taacccaacc 
ctttcacgga 
gaggactcac 
gcccagctca 
acgcatgtgg 
tgcagtggca 
catcagcctc 
ttgttttttg 
ttctgaaaaa 
tcttgtttta 
catgggggag 
gcactagcca 
ctgtggaaga 
aaggacttaa 
gtaaaattct 
gacgtttcac 
gtttcgccaa 
tcgccaagat 
ccaagatggg 
agatgggtgg 
tgggtggaat 
gtggaatggc 



tagcttccca 
ctatgtctac 
gggggaaacc 
agtgcagtgt 
ccacctcagc 
ttgtattttt 
gagttccacc 
ggccagctct 
ctggagtgca 
tctcctgcct 
attttctgta 
tc.tgaccttg 
accgcgcccg 
aggaataggg 
cggaaactca 
tggtcagaaa 
acgtgcagga 
ctgcatgggg 
tgtcccggaa 
atgccaaggt 
ggcttgggaa 
tgggaggact 
agccccaggc 
aggagagacc 
ggttctctcc 
ccctcttccc 
tcacactcac 
tgtgcccggc 
gccccgtgtg 
aactcccctc 
ttgtgcccca 
tgtgttcacg 
tcggggtgta 
tggcttcact 
aagcacgtga 
ccaagcgggc 
gcctcagcag 
cctgggaagc 
actcggtgct 
aggcccctcc 
aacgcagaag 
ctcgctggtt 
ggcacaggcc 
tcattgtctt 
ccatctcggc 
ccgagtagct 
acaaacatgc 
atggtgctaa 
ccctgtataa 
gtctttgcaa 
tgaagaaaaa 
gttgagaaaa 
aaaatttttt 
cccataacta 
caagatgggt 
gatgggtgga 
gggtggaatg 
tggaatggcc 
aatggccagt 
ggccagttaa 
cagttaacca 



tgctggtgct 
ctcttacagc 
tccctccccc 
cacagtcttg 
ctcccaagta 
tgtacagaca 
tgccttggcc 
tttttttttt 
gtggtgcgat 
cagcctcccg 
tttttagtag 
tgatccgccc 
gcccagctct 
ttggttttac 
cgaggactcc 
ctcacgtgga 
tgagggctcc 
tgctcacagg 
cctgaagcag 
gggggctcag 
cagctgcccg 
cacctctgtc 
cacaggtcag 
tccaggctgt 
ttcacgtcag 
agcgcgtgcc 
tcccctcttc 
ctgacactca 
cccagcccca 
ctcccagtgt 
cactttttac 
tgacgatggc 
tatttattga 
cacacggagc 
gctgacgata 
ccaagtgcga 
gtgcagcaga 
tgtagcagaa 
ttgagccctg 
ctgggacagt 
atctaaaaca 
tcaagactcc 
tgcaggccat 
ttttttcagt 
tcactgcagc 
gggattacag 
caatttaatt 
gaacagctgg 
acctaaatgt 
ctttggggta. 
cattaaaatt 
tgaaaaagca 
cagccctcct 
agcaaacaac 
ggaatggcca 
atggccagtt 
gccagttaac 
agttaaccac 
taaccactgg 
ccactgggag 
ctgggagagc 



ggcagctcgg 
ttcagagatt 
ttttttttga 
gctcactgca 
gccgggactg 
gggtctcact 
tcccaaagtg 
tttttttttt 
ctcggctcac 
agtagctggg 
agacggggtt 
acctcggcct 
gctttttctt 
ttttccatcg 
ccatcccttg 
ctcccatcca 
■ttccgggtca 
atcttctcct 
caagccgaca 
tcctgtagct 
agcctttgct 
actgggcagg 
ttttctgagc 
ggtccatagg 
gccatgggag 
cggccccaca 
ccagtgcatg 
ctcccctcct 
cgctcactcc 
gtgcccggcc 
ttagtgcagg 
gtggtgacgt 
gagctcatca 
tttgattcta 
ctgaagggaa 
tgtcggcggg 
gcaagggaag 
ccccacagag 
gctgcccctg 
tatatcacag 
gcaagatgac 
tctaaagctg 
ggagaaggca 
tggaatctca 
ccccgcccct 
gcgtgcgcca 
gagagaggaa 
atatctgttc 
aaaagctaaa 
ggcagagatt 
tagacttcac 
gttaaagaaa 
ctgatttgaa 
ccacttaacc 
gttaaccact 
aaccactggg 
cactgggaga 
tgggagagca 
gagagcatcc 
agcatccgga 
atccggacag 



ccttgttctt 
caagttataa 
aacagggtct 
gcctcagcct 
caggcacgca 
ctgttgctca 
ctgggattac 
ttgagacgga 
tgcaagctcc 
actacaggtg 
tcaccgtgtt 
cccaaagtgc 
agtggttctg 
agtttttaaa 
gtcggaaact 
tcccaggcag 
gaagacatgg 
ctctccttcc 
tcatattcat 
gacgactcct 
gcttcagggc 
ggctcaacct 
agtctgggag 
ccagtgcccg 
gcaccgagaa 
ctcactcccc 
cccggcccca 
cccagtgtgt 
ccccgccagc 
ctgctgccct 
tgggatcaca 
t tccagatcc 
tgctgggtgc 
gtggtgggga 
atagagcaga 
aggtggggaa 
aggtgagtgg 
agctggtgag 
ggaaccatct 
ctggtaagcc 
cgtgaagaag 
caggagtgga 
gcaagctcaa 
ctctgtcacc 
aggttctagc 
ccatgcctgg 
atgaaggttg 
ggaaaacagt 
ctaaaagtta 
tcttagtatg 
caaaatttaa 
gggagaaaat 
aggacctttg 
actgggaagg 
gggagagcat 
agagcatccg 
gcatccggac 
tccggacaga 
ggacagacgt 
cagacgtttc 
acgtttcgcc 



ttgaggacag 
taaagctctt 
cgctctgcta 
cccaggctca 
ccaccatgcc 
ggccagtctc 
aggcgtgaga 
gtctcgctct 
gcctcccgag 
cccgccacca 
agccaggatg 
tgggattaca 
cgttgtgttt 
gagacgacga ■ 
cacatggact 
cagcttccca 
cggcctcggg 
cagggtgtgc 
gccgtacaat 
gatgtccagg 
cttagatcag 
ggccagacac 
cgggcaggct 
ctcttgatcc 
cacaggaagc 
ctcccagcat 
cactcactcc 
gcccagcccc 
atgtgcccgg 
cctccccatg 
cgccacgggt 
cgtcgttggt 
tattccaggc 
caggtggaca 
gggaggaggc 
tgctggtggg 
gggcggctgg 
gtttgccgtg 
ggagagcttc 
gagtctaaca 
aacagagctg 
ggtggagatg 
gctgacccac 
caggttggag 
gattctccca 
cccttggtga 
atttctggtt 
gaatcttaac 
tagaaaggaa 
gatacacaag 
agcttcaact 
acttctttca 
accagagtat 
gatctggaca 
ccggacagac 
gacagacgtt 
agacgtttcg 
cgtttcgcca 
ttcgccaaga 
gccaagatgg 
aagatgggtg 



55740 

55800 

55860 

55920 

55980 

56040 

5610O 

56160 

56220 

56280 

56340 

5640O 

56460 

56520 

56580 

56640 

56700 

56760 

56820 

56880 

56940 

57000 

57060 

57120 

57180 

.57240 

57300 

57360 

57420 

57430 

57540 

57600 

57660 

57720 

57780 

57.840 

57900 

57960 

58020 

58080 

58140 

58200 

58260 

58320 

58380 

58440 

53500 

58560 

58620 

58680 

58740 

58800 

58860 

58920 

58980 

59040 

59100 

59160 

59220 

59280 

59340 
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gaatggccag 
cggccagtta 
ccagttaacc 
gttaaccact 
gagcacatgg 
ggagccgaca 
aggatgtgga 
gtgtggcaaa 
ggctcagtgg 
aagcccagga 
aaaaagaaaa 
aaaactggga 
gaacactaac 
tctcggacat 
gttttttgtt 
cttggctcac 
agtagctggg 
gacggcagtt 
ctttggcctc 
tgtttcattc 
tatatggtgc 
aatttttttt 
tctccactca 
gagtagctgg 
atttttagta 
gtgatccgcc 
ggccaatgtg 
acactctgtg 
gaagtctcac 
tctgcctccc 
gcacacacca 
ttggccaggc 
gctgggatta 
gagcggggct 
cttcctgacc 
tggcacgtgc 
ggtggggcgg. 
gttgctcatc 
cccggcctct 
acacaacatt 
tctccgctgg 
aacctttctg 
tgagcacacc 
tggagactgg 
tcaccagacc 
gccctcggca 
ggcttgagac 
ggatcacacc 
atgttgactg 
gcggaccacg 
actaaaaata 
aggaggctga 
tcacaccact 
. caaaaagaat 
ctccttctgt 
ctcctgggct 
caccaccacg 
caggctggtc 
gggattacag 
gggccacaca 
tgggaacttg 



ttaaccactg 
accactggga 
actgggagag 
gggagagcat 
aaagtcgccc 
ctgtggtccc 
acagctggat 
cagccgagcc 
ctcacgcctg 
gtttgagacc 
aagaaaagac 
agcagcccgc 
agcagccacg 
gctaacccag 
ttgaaatgga 
tgcagcttcc 
actacaggca 
cgccatgttg 
ccaaagtgct 
catttatgtg 
acataactgc 
tttgagacgg 
ctgcaagctc 
gactacaggc 
gagacggggt 
tgcctcggcc 
tgaaaattta 
ggtgggacct 
tctgtcgcct 
agatgaaa'gc 
ccaccccctg 
tggtcttgaa 
caggcgtgag 
ggggcaggga 
gcggtggccg 
ctttccatgc 
tgttgccagg 
tgcgcttgtg 
gtgagctcca 
gacctgaagg 
cctcctaaac 
gaggggctct 
aggagcttct 
ccagctgggc 
ctttccctcc 
gcagcaggct 
tgaggcagga 
actgcattcc 
ggcgcgctag 
aggccaagag 
caaaaaaatt 
ggcaggagaa 
gtactccagc 
aattggcaat 
cgtccaggct 
caagccatcc 
cccggctaat 
ttgaacccct - 
gtgtgagcca 
gtcatgtttg 
gctgtcagcc 



ggagagcatc 
gagcatccgg 
catccggaca 
ccggacagac 
agcatctcca 
actggcatgg 
ctcatccatc 
gggagaaggg 
caatcccaga 
agcctaggca 
ttcagtgtgc 
gtgagcccat 
ggcgtggact 
agaggcccac 
gtctcgctct 
gcctcttggg 
cccgccacca 
gccaggctgg 
gaggttgcag 
acactctatt 
cagaactcag 
ggtcttgctc 
tgcctcctgg 
gcccgccacc 
ttcacagtgc 
tccgaaagtg 
aaagtaccaa 
gggagttggg 
aggctggagt 
gattctcctg 
ctaatttttg 
ctcctgacct 
ccaccgcgcc 
actgctgggt 
ggggctccca 
tgtggaggag 
tcatccaggt 
atgttcgatg 
gcgctgcgtc 
ggacagtcgt 
acctcctatt 
ggccaaactc 
gccaccccct 
cagggacctg 
agccagctcg 
tgtggtcttg 
gaatcgcttg 
agcctgggtg 
ctcatgccta 
atcaagacca 
agctgggcgt 
tcactcgaac 
ctggtgacag 
tccagtgaaa 
ggag.ttcagt 
tcccacctca 
ttt£gtattt 
gagctcaagt 
ccgcgcccgg 
gacctacttg 
tcctgtgcct 



cggacagacg 
acagacgttt 
gacgtttcgc 
gtttcaccaa 
gtcataggag 
ctgaaattca 
gctgtgaagt 
aagacgtgtt 
actttagggg 
acatagggag 
aggtttacca 
ccacaggtga 
gtggtcacac 
tgaggaggac 
gtggtgcagg 
ttcaaacagt 
cacccggcta 
tcccaaactc 
gcatgaacca 
aatagaaaag 
tacacctaaa 
tgtcacccag 
gttcacgcca 
acgcctggct 
tcgccaggct 
ctgggcttgc 
agctggaccc 
ttttgttttg 
gcagtgacac 
cctcagcctt 
tatttttagt 
cgtgatccgc 
tggctgggag 
ctgggtcttc 
gggcatcgtg 
cgtctcccgg 
ctggcctctg 
cctg.ctgcac 
ccttctcttc 
gatctttgac 
gcttctggcc 
ctgaagcccc 
cccgccctga 
cccgtcaggc 
gggtcagcct 
cctgcagtgt 
aaccttggag 
acaaagcggg 
tzaatcccagc 
tcctggccaa 
ggtggcgtgt 
ccaggaggta 
agcaagactc 
taattgtttg 
ggtatgatct 
gcctcccgag 
tttgtagagg 
gatctgccca 
cctgaaacaa 
tggcctttta 
tccgcacccc 



tttcgccaag 
cgccaagatg 
caagatgggt 
ggtggatgga 
aaggcagacc 
gaagccctga 
tgtgtagcca 
caaagattca 
ccaaggctgg 
accccatctc 
gagttttgtt 
atggacagac 
agcagcaggg 
ctactgtttt 
ctggagtgca 
tctcctgccc 
atttttgtat 
ctgaccttgt 
ccgcacccgg 
gcaggggtgg 
atgaacatct 
gctagagtgc 
ttctcctgcc 
aatttttttt 
ggtctcgatc 
aggcgtgagc 
caccccagat 
tttcgttttg 
aatctcggct 
ctgagtagct 
agagacgggg 
ccgcctcggc 
ttgggtttgt 
ctggctcctc 
gccgtctgtc 
tatggcgaac 
ctctcgacat 
atgtcttggc 
ctcctgtaga 
gaagctcaca 
tttttgtcaa 
aggtgcccag 
tccgatgcct 
gcagggcccc 
gggccagggc 
ctctgccctt 
gcagaggctg 
attctgtgtc 
actttgggag 
catagtgaaa 
gcctatagtc 
gaggttgcaa 
cgtctcaaaa 
tttgtttgtt 
tggcccactg 
tagccgggac 
cggggtttcc 
ccttggcctc 
tcgtttctaa 
cagaccccag 
caccccatt.t 



atgggtggaa 
ggtggaatgg 
ggaatggcca 
atgaccagtt 
aaagccacgg 
gtgtggcatg 
ctccacaaac 
tatgtggcca 
gggatcgctt 
aaaaaaaaaa 
tgcagttgcc 
cgtggtaccc 
agccgatgag 
ttgtgttttt 
gtggtgtggt 
cagccttccg 
tttcagtaga 
catccactca 
ctggacctac 
ggctggtggt 
taatgtgtga 
agtggtgcga 
tcagcctccc 
ttttttttgt 
tcctgacctc 
caccatgccc 
tgctcccacg 
tttttgagat 
cacattaacc 
gggattacag 
ttttaccatg 
ctcccaaagt 
aaatctccct 
tggtctgtgg 
ttgctgagcg 
tgctggttag 
cgccggcgct 
ttccctcttt 
gccgcagagc 
acgtggtgag 
gagccacgca 
gactggggac 
ctgctggggc 
cacaggccgc 
tgtctcctct 
ccggccacat 
cagtgagcca 
aaaaaaaaaa 
gctgaggtgg 
caccgtctct 
ccagctactc 
tgagccaaga 
aaaataaaat 
gagacagggt 
caacctccac 
tacaggtgca 
cagcgttgcc 
ccaaagtgct 
atattggtgt 
gccaaggctt 
ctgctttctg, 



59400 

59460 

59520 

59580 

59640 

59700 

59760 

59820 

59880 

59940 

60000 

60060 

60120 

60180 

60240 

60300 

60360 

60420 

60480 

60540 

60600 

60660 

60720 

60780 

60840 

60900 

60960 

61020 

61080 

61140 

6120O 

61260 

61320 

61380 

61440 

615-00 

61560 

61620 

61680 

61740 

6180O 

61860 

61920 

61980 

62.040 

62.100 

62160 

62220 

62280 

62340 

62400 

62460 

62520 

62580 

62640 

62700 

62760 

6282.0 

62880 

62940 

63000 
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gaacccccga tcctgtcctg ttctgtggtg attcgggtgt gcttgggctc taggagaaga 63060 

tgtgtgaaga atcggcatcc tctgacctga ctccccatga cctggcttca ggactggacg 63120 

tcatagacca ggtgctggag gagcagacca aggcagcgca gcagggtgag ccccacccgg 63180 

agttcagcgc ggactccccc agcccaggtg cgttcatagc cagaccgctt ggtcctgagg 63240 

cctgcgctgc tgcagggtga gccccacccg gagttcagca cggactcccc cagcccaggt 63300 

gcgttcatag ccaggctgct tggtcctgag gcccgtgcta ctgcagtggg cagcctgccc 633 60 

tgtggctgtg tgtggtcggc ctgggcacca tctattcagg ctggcactgc agggcatccg 63420 

cttctctcag aggcttcttg ggtgcgaatt cttcagggtc ctgtagcctg tggaagggct 63480 

ggtattgttc agtagttctg gtattttcca aagacctatg tcctctccca gccagtatca 63540' 

acttggcctc tactgtgtaa aactggaaaa ctctactttg tgaagctgag ttgggagcat 63600 

cgcttgaggc caggagtttg agaccagcct gggcaacatg gcggaacctc- gcc.cctgcca 63660 

aaaaattagc caggtgtggt ggtgtgctcc tgtggtccaa gcttttctgg aggccgaagt 63720 

gggaggcgtg cttgagcctg ggaggcagag cttccggtgc cccagatgac tccactgcac 63780 

tccagcctgg gcggcagagt gaggccatct caaaaaaaaa aaaaaggaaa actaaataca 63840 

ttcactgtaa gggcattttg catctttaaa tgacccacaa atctggcatg catcagctgc 63900 

tccgcctgta ggttccttcc cagtgtttgt ccagaggtgt atttccacac agcgctagtc 63960 

acggcatatg tggaaaacgt ggaaaccctt catggatgtt gtcagttggt ctatattttc 64020 

tttctttttt ttttttttga gatggagttt cacttttgtt gcccaggctg gagtgcaatg 64080 

gcgcgatctt ggctcactgc aacctccgcc tcctgggttc aagcaattct cctgcctcag '64140 

cctcccaagt agctgggatc acaggcgtgc accaccacgc ccagctaatt ttg.tattttt 64200 

agtagagatg gtttctccgt gttggccagg ctggtctcga actcctgacc tcacgtgatc 64260 

cacccgcttc ggcctcccaa agtgctggga ttacaggcgt gagccgccac gcccggcctt 64320 

tgtccatacc ttccacatgg cttctgtaaa cagctgacta ggagtctgtg tgaatatctt 64380 

cataggttct gccgtgacac tacttgctcg tgagcatccc caggtgtaaa cagcatcagc 64440 

ttcccccact ttcctttaaa atcgcacatg tggacggaca ccacggggac cctggaccct 64500 

ggggagcccc gtcctcaccc ttctcaccag gatggctgct tggtagagag tgagtttgca 64560 

aagttggcat ttgtttagta cagaagttat caggtgttct ggctttagaa tccctttata 64620 

tatatatata tatacatata tttaagtgac agggtctcac tctgttgccc aggctggaat 64680 

gtggtggtac aatcaaagtt ccctgtagcc tcggcctcct gggctcatgg gatctccccg 64740 

tctcagcgtc ttaaagcg'cc gggaccacag gtgtgcacca ctgccaccgg ctctcaagat 64800 

tgccacgcag ggagttgcag tgggggaagg ggttcctggg actttgaacg ctccacctcc 64860 

ctcctctcca cagtccccca accccacctc tctaacgggg tggacggccg cctctttcca 64920 

tccttcgctf ggcgcagggt ggggagagtg acaggtctcc ttccctcacc tcggcagctg 64980 

ccatttcatc gcttacataa cgtgggagaa acacccaccc acccccaggc ctgtgtgaac 65040 

atcaccacgg ggccttctcc actcttcagt tttgttagtt acttgatgtg cagggctttt 65100 

tgttgtaact agtgggggac gtgtggtggg gtgggcttct gccatctcat tcaggaccag 65160 

aacttcagtt Ctcatcccta tctgttcccc cacccccttg gagatggggt ctcactctgt 65220 

cacccaggct ggagagcggt ggtgccatca cggctcactg cagcctccac ctcctgcagc 65280 

ctccacctct tgggctcaag tgatccccct gcctcggcct cccaagctcc tgggactaca 65340 

ggcgtgtgcc actgtgcttg gcagggtcca ttcttttcct cacactttat ttattgaaga 65400 

gcccaggccg tttaccctgc agagtcggaa tctgtacagg aggggcagcc acacgagttc 65460 

cccggtttac tctgaactta ggtggcttga gggccccagt tagactgcgg ccaccgtttg 65520 

ccgggctcca gatgggacgt cctttctafc agaaggctca cagtatctcc tttcccgttt 65580 

cttcccatgt gaacattgtt gctgctgaac acctgaatat gttaatcact gggggcttgc 65640 

aagatggcag tgtgctaatt ccatcatcta gtcagttagc aggaacaact taggaccacg. 65700 

ccctgcacca tatcagctat gtggtgatcc cattcacaca ggaaaggtgg gacaaatgct 65760 

gggggtgggc cgggtgtgcC gtctcacacc tgtcatccca gcactttggg aggcccaggc 65820 

aggcggatca cgaggtcaga gactgagacc atcctggcca acacggtgaa accccgtctc 65880 

tactaaaaat acaaaaaaat tagccaggtg tggtggtgca tgcttgtaat cccagcnact 65940 

tgggaggctg aggcaggaga atcacttgaa cccaggaggc ggaggttgca gtgagccgag 66000 

atcgcaccat tgcactccag cccggcaaca gagcgagact ccgtctcaaa aatcaatcag 66060^ 

tcaatcaagt gtcatcactg aatgtttgtg tgtgaacgtg gggattggtc ctgccccatg 66120 

ctccctcctg aatctcactc ctgacctcag ttgctgcacc ttgaggtgtt ttctgtgggc 66180 

tcttgtgtcc tgaccccggc ggttgtggcc tctgctgtct gggagtcagg atttttcaca 66240 
ctcatgtccC gctccagacc tggaatcagc caagcctcca agaagccctg ctttcttttc ' 66300 

ctgcaagacg gtatttcaag acccgccgtg cggcagcggg ttggtcatgg tcactgggtt 66360 

ggtcgttgtg actgggtgtt ttcgtggaga tacagccata cgcacaggtg tgttcacaaa 66420 

tgccaattct aaaggtcaaa cacccggcca ggcataaggg ctcagcggta atcccagcac 66480 

tttgggagac. caagactggt ggatcacctg aggccaggag tttaagacca gcctgagcaa 66540 

cagggtgaaa ccccacctct accaaaaatg cgaaaattag ccgggcatgg tggcgcacac 66600 

ctatagtccc agccagccgg gagacagaca cgagaattgc ttgaacctgg gacatggagg 66660 
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ttgcagtgag 
ctcaaaaatg 
gatttggcct 
tgtggcctct 
ccttcccctg 
tggtggattt 
gcctgcacgg 
ttctgtgtgg 
ctcttccttt" 
gggttgttgg 
aacatggagc 
gcagctgggt 
atcctgctgc 
gccaccaagc 
. gagctagaaa 
caggcctgtt 
tgagatgttg 
ggcacacaca 
acagcacaca 
gaacagcaca 
cggaacagca 
cacggaacag 
cacggaacag 
tcccacgcgg 
ctggtccagt 
gatgagtgtc 
tccgtggggc 
gcagatgagg 
ggtggagcgt 
ggagccactg 
ccctcagggt 
gggactctgc 
agtgacccca 
gtgctgttct 
cctctcacct 
cCtccacagt 
gcctgttttc 
ctgccttctt 
gccttcttct 
agaccaaggg 
agtgctggca 
atcttggcag 
aaccccatct 
ctgtcccagt 
gagaaagagc 
ccgagttctg 
tgggctgctc 
gtgctgggga 
agtcccactg 
agagcaggcg 
gcaggagatg 
ggtcggggcc 
gagtgttcac 
ctcctctgtg 
agttaactga 
ggcaaactct 
gggaggacga 
acagtggggg 
ctgctggcag 
ttcctgaaca 
„ gtccctggga 



cagagatggc 
aatagataaa 
gtgtagcctc 
tccttcccct 
ttggtggatt 
ggcctgcacg 
attctgtgtg 
cctcttccct 
ccatgttggt 
cacgagcgtg 
tggaagacat 
ggggcctcct 
gcctggaggg 
cagggaggtg 
cgggccatgg 
ttcccttttt 
agaggagcag 
cccatggaac 
cactcccacg 
cacactccca 
cactctccca 
cacacacacc 
cacacacact 
ggccgctggg 
ccgtcatttg 
ccagccacca 
tccagtcttc 
agccagggct 
gttctgcctg 
tcctcagcgt 
cccctcctgt 
ccagagccac 
tccctcccct 
cgtgctggcc 
ccaacgctgg 
tgttgccttc 
ttctcggcag 
ctcctcagtt 
cctcggcagc 
ctgcatcctg 
cggggtcttt 
tcagggctcc 
tctgatcggg 
tctgcagctg 
agcccaaggc 
cttctcagag 
tggtttgaga 
gggtggcttt 
acccccttgg 
aac'tgcccgc 
gagcttggca 
tcccctgcct 
caacacggcc 
gca-tctcctt 
ttc.tagactt 
cggctccttt 
ggtctggtgg 
ccggggagcc 
ccccagcagc 
gcttctccag 
gcccatgatt 



gctgctgcac 
taaagataaa 
ttccttcgcc 
gttggtggat 
tggcctgcac 
gattctgtgt 
gcctcttcct 
cccatgttgg 
gtcctttttt 
atgcagac.tg 
tgcaaagctg 
ccttgcgagg 
ggccatcgat 
agaggcgggg 
tgcagtcctg 
gagacctggg 
cacacactcc 
agcacacaca 
gaacagcaca 
cggaacagca 
cggaacagca 
cacggaacgg 
cccacagaca 
tttcctgcag 
agcacaggtg 
cctcttttgg 
tccgcagcca 
ggggcctgtg 
agcgccgacc 
aatgctcaag 
agcttctgct 
cgcctcccag 
ctgacggcgg 
aggcctcctc 
cgteccctgg 
agttccaaag 
ctacatcttt 
cccaaagctg 
tacatctttg 
gactcgctgg 
ggtgcgggca 
cctggccgtc 
gcgtggaggc 
gcggcttcgt 
cctagctggc 
ttgtggggtc 
cgccgggccc 
tgctgcttca 
agactctgtc 
cctgaatgga 
gtcgggctga 
gcagtgtggg 
ggactgcaga 
ccctgatgga 
ggggatggga 
ccagtgccct 
cacatgccca 
gctcctgccc 
cccagctgca 
ggacagacaa 

gggggtgcga 



ccccgcctgg 
acacctgctc 
tgttggtgga 
ttggcctgca 
ggattctgtg 
ggcctcttcc 
tcccctgttg 
tggatttggc 
ccatgccagg 
cctttgctgc 
aagagtaagt 
aggtgggtga 
gctgttgagc 
agccagcccc 
ggctgtcaca 
aggagcacct 
cacgggacag 
ctcccacgaa 
cacacc.cacg 
cacacaccca 
cactctccca 
cacacactcc 
gcacacacac 
tttctcctcc 
cctgttagaa 
actccgggag 
gggcagcagg 
tgggggctct 
tggtcggggg 
gctctggcct 
gccccatcac 
aagcccctga 
cccctgctct 
tccccacctg 
agggctgaat 
ctgcagcctg 
gaccagtgtc 
cagtctggtc 
agctgtttgc 
accagatcat 
aatgtggcgt' 
acctggccgt 
gttagtgcca 
cctacagcct 
ttgtggggcc 
cagaggcttt 
agcggggtct 
acccttaggc 
cctgcctggc 
tgctgcgctc 
gcgggctcat 
tgtcagcgcc 
agctggcgga 
agccgggcgg 
gaaaggcccc 
aaacccacac 
gggtgatgct 
tgtccgggcc 
cgcagatgaa 
. atggggaccc 
ccctggcccc 



gtgacagagt 
ctcttggtgt 
tttggcctgc 
cggattctgt 
tggcctcttc 
ttcccctg.tt 
gtggatttgg 
ctgcatggat 
aatcctggtt 
ctttctcttg 
gttgccc.tcc 
cacctcctcg 
tgcctggaga 
ttcactgcag 
tcacgagtga 
gctttgcatg 
cacacagccc 
cagcacacac 
gaacggcaca 
cggaacggca 
cggaacagca 
cacggaacag 
acccacggaa 
tccaggcctt 
cgagaccttc 
gcctggaacg 
gtttgctgtc 
cctgagcgcg 
aatcccagt t 
ggctcctcgg 
tgtcactctc 
caacctcttg 
gaggcggctt 
gaggctcctg 
ttgtttccga. 
atttccccct 
gtcccccctc 
cccccgccag 
tgaagcccag 
ccagcacctg 
agggggtgca 
cagcaggaac 
cttgccagct 
catcccatta 
tctggcttct 
gcccagaggc 
ctcctctgct 
gaccatagag 
ttctctcctg 
caccctgggc 
gctggaaggg 
ctgctgccct 
cattatccag 
gtgccttctc 
tacaccacct 
tgggcctcct 
ggtgagggag. 
ctcaggccag 
gagctctgga 
tgcaggttcc 
cttctcattg 



gacaccctgt 
ctccagtttg 
acggattctg 
gtggcctctt 
cttcccctgt 
ggtggatttg 
cctgcacgga 
tctgtgtggc 
ctcaagggcg 
cccagggctg 
ccgcctcctt 
acccacagtg 
cgacagcggt 
gcccagccta 
ggcctgtttt 
atctggttgc 
cccacggaac 
actcccacga 
caeacccacg. 
cacactccca 
cacacactcc 
cagactctcc 
cagcacactc 
tccctggacc 
. ttgttaggac 
ttctgaacgc 
tgtcctgcag 
cagccgccga 
gcttccaggt 
ccaccctgca 
caaagctttg 
acgaccccct 
cttttcctcg 
agggcggagg 
. gggaaggaaa 
ccaggctcga 
aggcccgagc 
gctcgagcct 
atcacgtttc 
gcaggacgtg 
gcaggcctcc 
aggcccacag 
gccgtagagc 
ttctgctttt 
ccacaccacc 
ggtgtcccca 
gcgctcccgg 
cctcttttca 
gctgctggga 
cccccattgg 
ccggggctgg 
ccaggtgctg 
gtggggcctg 
ctgctgtatt 
gtttctgatt 
gcagggatgg 
gacgcaaagg. 
gggggaccca 
cacacgcggc 
cggcaggggt 
gccccgtcct 



66720 

66780 
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66900 

66960 

67020 

67080 

67140 

67200 

67260 
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67560 
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67920 
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68160 
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68340 

68400 

68460 

68520 

68580 

68640 

68700 

68760 
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68880 

68940 

69000 

69060 

69120 

69180 

69240 

69300 

69360 

69420 

69480 

69540 

69600 
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69720 

69780 

69840 

69900 

69960 

70020 

70080 

70140 
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gtcctgcaat 
gctttccctg 
tctgggcacg 
ctgactgggg 
tgcccacgct 
cgctatgaca 
gcagaagaac 
tcccggaaca 
cctgggcagg 
ccactgtggc 
tggggctgga 
atcccaggcc 
cgcatggagt 
tccatgtccc 
tgcagcccag 
cagcttccca 
' aatgtggctt 
ccagctatgg 
cccagctatg 
ccccagctat 
ctcccact ta 
gcccccactt 
tggaatttcc 
ccctcgggtg 
cccctgcagg 
tcacaggtga 
acagctcagc 
tagctggagg 
ccccgtgaga 
aggcacaggg 
ctggagtgca 
cctcttgcct 
atttttaaaa 
tcctgggctc 
gcttccatgc 
gccctgctca 
gggccaaatg 
gctgggaagg 
cgtgcacatg 
tgccaagtct 
tccgcatttc 
gcggcggggc 
ggctttgcgt 
tcctctccta- 
tgaggatccc 
ctctaagttc 
gacatacata 
ggtgaacaca 
ctaaaccggg 
ggcctccatg 
ttcccagcac 
tccactgggc 
tcctggtgtt 
gttgcccagg 
ttcacgccat 
cgcctggcta 
gtctcgatct 
accgtgccca 
ctcgaactcc 
ggtgtgagcc 
tagtgcatct 



gcccgtccca 
gcgtagaggt 
ttgcctctcc 
aagccaaggc 
gtacctgctg 
gcttcatgag 
cctgggatgc 
agcaggcact 
gaccaggtgg 
cgtctctgtg 
aggacagcac 
cagacccctc 
gtggttggct 
tgatgggtca 
ggcctggagg 
ggctcccact 
ctgaggttcc 
agaagactcc 
gagaagactc 
ggagaagact 
aggagaagtc 
aaggagaaga 
tttggctgct 
cctggaggat 
tgtgggcggt 
ccctgggggc 
tggaggcgaa 
caaactctgg 
ctgctgggtc 
ggatttgtgg 
ggggcttgat 
cagccacccg 
tttttttgta 
aggcagtctg 
ctgcccagca 
tggcagccac 
ctttggtaaa 
aaggaaggaa 
catgcagagt 
ctgtgttcca 
tgtgaatgac 
tctgacggcg 
tcctgttctg 
gtgcaggcat 
atatacatat 
tgaggatccc 
ttctttcctt 
tgtttccttg 
tcttgtgtat 
ccttctgacg 
gccacaccgg 
acctgccaca 
ggcctgaacc 
ctggagtgca 
tctcctgcct 
attttttgta 
cctgacctcg 
gcctgatatt 
tgacctcagg 
accacgcccg 
tggagtcagc 



tgtgaggtct 
gcttgtccgg 
ggggcctcgg 
acaggtggct 
gccacacgag 
tgtccatctg 
atggctggcc 
gc.cttctcct 
ggtgggcaca 
gccagtgacg 
tgccttgtcc 
tgtctgtggc 
cttcaagcag 
ctgcaaagca 
gcaggtgtga 
tatggagaag 
tacactcgaa 
acactctgga 
cacactctgg 
ccacactctg 
tccacactct 
ctccacactc 
gctctaagta 
gagggactgc 
gcccataggg 
agagtggggc 
ctccaggcag 
gcagtgctgg 
ctcatccctc 
tggtgttttt 
cacagctcac 
aggagctgtg 
gagatgaggt 
cccgccttgg 
gaaggctttt 
cagcgtgggg 
acacaaggcc 
cggaaggaaa 
acgcacacac 
gccgtggtgt 
tccggcttct 
gtggcttggc 
gccgtttttg 
cattcctttc 
actctctgta 
atatacatat 
gtctcatgct 
tcagattttt 
tctccaaact 
tcacctgtgg 
tggttctgtg 
cttttcctca 
aatttctttt 
gtggcgcgat 
cagcctccca 
tttttagtag 
tgatccgcct 
tttagtagaa 
cgatcctctc 
gcctcttgtt 
gtgtctactt 



gcttctggct 
tttgtggagg 
tggctttttt 
gtgtggagcc 
atcatggcag 
gcccgtgggg 
tgtgcatgct 
tcacagacgc 
gggtggtggg 
ccacagcctg 
tcccaactcc 
tcctgcctgc 
gaggccgtgc 
cctccagcac 
gctggcccgg 
tctcctccac 
ctgaatcctg 
accgaacccc 
aaccgaatcc 
gaaccggatc 
ggaaccggat 
tggaaccgaa 
gccgctggtg 
acacagtgct 
ctggtgctgg 
cagtggcccc 
ggtcaggccg 
cccgtgttgg 
tctgcctgag 
ttgagaaagg 
tgcagcctca 
aacacaggtg 
cttgccatgt 
cctcccaaag 
cgaaggaagc 
gtgaacttgt 
ctgtgtttag 
tttctctctc 
gcacgcacgc 
ctgccccccg 
tggtgttccc 
tgacatggcc 
gctcgttttc 
ccattgattt 
agttctgagg 
tctctctcta 
ggtcattttt 
gttctgagct 
gcactgtaga 
gggtgtgaaa 
cgctgaccga 
tgtttttgtt 
tgtttgtttt 
ctcggctcac 
aatacctggg 
agacgaggtt 
cccaaagtgc 
atggggtttt 
accccggcct 
cttttgaaac 
cctgtaaaaa 



ccatgcctat 
gcacgcccca 
agaagcagac 
ctgtgaggcc 
ggttaggcag 
tgcttgagct 
gctgggcatg 
agccctgagc 
gcccaggctc 
tgtcttctct 
tccccaaagg 
caagggctgc 
acctatcagg 
atggccaggc 
gcctgtccct 
actatggaac 
gaatgcggct 
ggaacgcggc 
tggaacgcgg 
ctggaacgcg 
cctggaacgt 
tcctgcacac 
gatgactcag 
cacccgcgtc 
gttgggcctg 
aggaagagga 
tgtgctcgga 
ggaaccagtt 
gccggagctg 
gtctcgcttt 
acctcctggg 
tgcaccaccg 
ttcccaggct 
tgctgggatt 
tgtttcctga 
tctgttactt 
ctgtcttgac 
cttctgtgcg 
ctgcacaaat 
gtgt.cctcta 
agcagaactc 
acattgctga 
caggaacggt 
gcagggttct 
atcccatata 
agttctgagg 
tccattttca 
tgtgcctccc 
gtgacggagc 
ggcagactct 
gcggctcggc 
eactgtggtt 
tgagacagag 
tgcaagctcc 
attataggca 
tcaccgtgtt 
tgggattaca 
gccatgttgg 
cccagagtgc 
ctgccctgac 
tcttactgtg 



ggcagcacct 
ttttgggtgc 
tcagaagtcc 
tcctctgtgc 
ggctgcccag 
gggggaggcc 
gagctgcaga 

gggggcgaga 

agccctccct 
gtgcggtagc 
cacggtgggc 
tgtgctgtcc 
cggacctgct 
gaggtagccc 
ccctggaata 
tgaatcctag 
tccaaggctt 
ctcccaggcc 
cctcccaggc 
gcctcccagc 
ggcctcccag 
tccatcggtt 
cttctgccag 
ggctcctgag 
cagccctgag 
tgtgggatgc 
agtcagggct 
gcccctgggc 
ccctgggctg 
gtcaccccgg 
cccaagtgat 
cactcagcta 
ggtctcaaac 
acaggcaaga 
ggcagactca 
ccacccccgt . 
agtgaaaatg 
tacccaggca 
ccacgcatgt 
gttcggcttc- 
cctcaagtct 
gcctgttggg 
cgtcacgcgc 
ctgtaagttc 
catattctct 
atcccatgcc 
tgacaggttt 
gaccaagatg 
tttgtgtctg 
accttgattt 
ttcccccaac 
ttgtcgtaag 
ttttgctctt 
gcctcccggg 
cctgccacca 
agccaggatg 
ggcatgagcc 
ccaggctggt 
tgggattacg 
gttttttcca 
a.ttttgacta 



70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
7.0980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
71520 
71580 
71640 
, 71700 
71760 
71820 
71880 
71940 
72000 
72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 
73860 
73920 
73980 
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gaatgtgttg aattcctgtt ttttttttga gtcagggtcc ctctgttgcc caggctggag 74040 

tgcagtggga ccatcacagc tcactgcagc ctcaacctcc tgggctcagg ggatcctctc 74100 

agctcaacct cccaagtagc tgggaccaca ggcacatgcc accatgcccg gctaggtttt 74160 

tttttttttt tttttggtga acaccctggg gttgcaccat gttgcccagg ctggtctcga 74220 

actcctgggt tcgggcagtt tgctcctctc agcctcccgg agtgctggga ttacaggcct 74280 

gagccactgc actaggccat gttgaatttc tagattaatt tggggccctc aggggcacag 74340 

agaggagggc tgggccagtt ggcgggagga gaggcccccc gggctgccgc attttcagtg 74400 

catggagatg gcctatgttg ggggaacaca gagctcaccg ggggtccctg cagggaggag 74460 

aaagggtcag gcaggtgcca gctcctgtcc attggcctgg ggctgcatga tggcaggggc 74520 

cggtgaaccg atgacccctg ggtgtcctgt gaccttctgt gtatgcggct gatgctgcag 74580 

aaagtcgggt ggcctcaggc tcctgacggg gctgcacttc ctctgccttt cagattgtgt 74640 

ccagtgtgga cccctccgag ggcagccctg gttccccagc agggctgggg gccttacagc 74700 

cctataaggt aggggccacc tccaggaggc aggtggaggg cagcccttgt tccccggcag 747 60 

ggctgggggc cttacagtcc tataaggtgg gggccacctc caggaggcag gtggggctgg 74820 

gggtcttctg gtcctaaaag gtaaggggct gcccccagga catgggcggg gcctccacac- 74880 

tcccggtcct gtcccctcca ggtgcacatc catcctgatg ctggtcaccg gaggacggct 74940 

cagcggtctg atgcctggag caccactgca gccagaaagc gaggtacaga- cctgggccca 75000 

cacgctcccc gcccgcccgg g.tgcagtgcc cggcaccacc atgccacagg ctaggcacat 75060 

gcccagccgt ggatctcctg cccccatggg cc.tggccacc ttctccatat ccaggccaat 75120. 

ccagagcatt ctcctcactg tccctctgaa gatcggagtt actgagagac gtaggagatg ■ 75180. 

gcctgatggc accgtgacct gcccagagtc acctggttgg cggtggcaga gccacagccc 75240 

agccaggcct ccctgctggg acacgctcgt ttatgccgag gccgtcagca cagagcctcc 75300 

acagtgaggc acggctctgc ctgctgcctc cacgcagcgc ctggccgggc caagcctcag 75360 

ggtcacatct gaagggggcc cggctggccc tgttgtccga agcccctggt gcgctcagcc 7 5420 

ccgaggcccc acgtgccttc ttggcttcct gtgctccgtg gcgtcttcga gtcggtgctg 75480 

ccggggacgc tgcgtggatg gggtctgtga gcgtgccctc ggctccgtgt ccggagccct 7 5540 

gtggttcttg gggtgtatct ggccccaccc ccactgcgtg gtgtccaggg tggggcttca 75600 

cggctgcagc tgcgggagct gctgcccctg ccttgtgctc cagtggggcc ttgcctctgg 75660 

gcttggttcg tccctctctg gaacattctt tctcagctgc tgtccgaccc atggtggcat 75720 

gacgtggccc tggctgaagc agcccttgtg cggttgctgt ggttgggtct gcctggccga 75780 

gccggaaggg aagggccggg agggcgtcag ggtggcgtgg cttgaccccc gctcggtgat 75840 

ggtcctgcag caaggcctct cccagcagga agcgtccatc ccggggggag gccggcgccc 75900 

ctcacgcagt tggggttgcg ggaggcagtg cgtgcctgag gcagccggtg cacagacccc 75960 

aagggcctgg aatctgtttg ttccattgac ctctgatgtc acttgacttc tcagaagcag 76020 

ccactccctg cactgggcgt ttgtaggaaa tgagctc'ctg gaggaggggg tggggaagtt 76080 

cccccattgc agggcacact cagccccagg aaggaaacgt gcctcgtccc tgctgactcc 76140 

gaatcgcagt cagagtcgtt ctgcttgtgc cgtgttgaat tcccggcatc cggcatccag 76200 

actcagcctc- ctGcccaggc cacggccgcc gtggccagtc ggtcaagccc ttctaggaac 76260 

ttcctttgag ctggcgccct tgttcactgc tgacgccact cagaggcttg tgcacgtgtc 76320 

ctgcttccag gcagagctgg. gaactcgcac cccgtcttcc gcacgcggcc gcggaatgtc ' 76380 

gggatgccgg cgcttccttc ccgtgtgctc ttggcggggt gggcttcttg ccccgagccg 76440 

catgtcacag tttctgcaga agtttagggt tggagtgggc tgacctctct gcaggtgtcc 76500 

ccagcctctg cctggggtct gcctcctact cccaggaccc cctgtccccc agaggggccc 76560 

caagctggca' ggctcacact cagggcagcc tcctttgttc cgacttctgc acagtgggcc 76620 

tgggtgg.ctg cccgcggctc gcttgcttga tgccagtggg tggagagggt gatgggcaga 76680 

gaggcaggtg gtcaggcccc cagtcccgtc ctcacactct gtgccctctg ccgccccccg 76740 

ccccacaggg aaggtgctga' gctactggtg cttcagtccc ggccacagca tgcacgagct 76800 

ggtccgccag ggcgtccgct ccctcatcct taccagcggc acgctggccc cggtgtcctc 76860 

ctttgctctg gagatgcaga tgtacgggcc acccctgcca gggcctgagc accggtgaca 76920 

cctctgacat cagcggggtg gaagtggtgg gggtccccat gagccgggtg ccgggggtct 76980- 

cgggcctcga gggctaaagg ggtgctggtg cacttcccca ctgtctgctc cctctggcca 77040 

cgctcagccc tttc.ccagtc tgcctggaga acccacacat caccgacaag caccagatct 77100 

gggtgggggt cgtccccaga ggccccgatg gagcccagtt gagctccgcg tttgacagac 77160 

ggtgagggcc tgtccctggg ccctgctggg gtgggaggtg ggggagcact gaggcctgag 77220 

gccctgagca gtggcctctc cggctctagg ttttccgagg agtgcttatc ctccctgggg 77280 

aaggctctgg gtgagtgccc tgaatgcccc agctgtgcgc atcctggatc ctggacccct 77340 

gctcccaaga gctggtaggg acccctgcag acatcctgcc cctgccttga ccccggcccc 77400 

tgcacttcca ggcaacatcg cccgcgtggt gccctatggg" ctcctgatct tcttcccttc 77460 

ctatcctgtc atggagaaga gcctggagtt ctggcgggtg cgtctcccct gtgttctggg 77520 

cggggtgggt gagggcaggg ctggagcatg aagcaggcag tggtcacagc tcctgcttgc 77580 

cctcatcgga tcggcggcgt gaccagggct gccgtgtccc tgcctcttcc tcccacaggc 77640 
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ccgcgacttg gccaggaaga tggaggcgct gaagccgctg tttgtggagc ccaggagcaa 77700 

aggcagcttc tccgaggtcg gcacttggcc ggggctctgg gcctgctgcc ccctcgtgcc 77760 

tcccctgcct ctcacagctt ccccaaggct gaccactggc cctgaccatg ggctccggcg 77820 

gctcccgctg ccccttcagg gctcctgcgt ttccttcctg gccctgagtg ttgcctctta 77380 

tcttacaaag cccccagcac cgggtgggtg tggtaacagt ggccctcctg tctgagtagc 77940 

cctagtcggc caccctggcc ctggggttcc ccgtgttttc tgggaagcac tgagcaggcg 78000 

tggggtcagc ctgggatccg tgccaggaag aagcttccag aacccgattg gccttcctgg 78060 

ctaggacgat ccttcacctt ggagcatgag acctgggtct ccctcatggg ggaggaaggg 78120 

gctggggggg ggctccaggc tcagcctcac caactctcct tccagaccat cagtgcttac 78180 

tatgcaaggg ttgccgcccc tgggtccacc ggcgccacct tcctggcggt ctgccggggc 78240 

aaggtgagct ctccagggcc ctctgccctg acctggttgc' ctgttccctg gtgggtgctt. 78300 

atggctcccc agcagactct gggccctggg ggctgcccgg tcccctcctt gggtcccacg 78360- 

agagcgactg ctggccctgc tgggagcgtg tcctgctctg ggcctgggca ggcaggatgg 78420 

gagtttcctg gccacaagag ttggaggtgg cgtctgggag ctgtggaccc caagtggggt 78480 

cctgacccac agatggagct tcctcccacc cctggttggg gacggagcct cggggaaggt 78540 

ggctgggctg ggtgtgggca ccagggagag gagcccccac ggccccaggc agctccctgg 78600 

tgtgtcccct aggccagcga ggggctggac etctcagaca cgaatggccg tggtgtgatt 78660. 

gtcacgggcc tcccgtaccc cccacgcacg gacccccggg ttgtcctcaa gatgcagttc 78720* 

ctggatgaga tgaagggcca gggtggggct gggggccagg tgagttacag cagggtgggg 78780 

ctggggtaag gcggtctggt gactgagccc ccgccccgtg gccaagggag cccccgtgac 78840 

cgagccgccc cgccccacag ttcctccctg ggcaggagtg gtaccggcag caggcgccca 78900 

gggctgtgaa ccaggccatc gggcgagtga tccggcaccg ccaggactac ggagctgtct 78960 

tcctctgtga ccacaggtgc gtgcagtccg gtggcaggcg cggcgccagg ggacacgccc 79020 

acaccccact gggcccctgg actctcctcc cccacatgag gccccgtctc ctccagagcc 79080 

tctccggcta ctcggggtca gcgtggggcc cctgcagcag atgagggtct tcacttcggt 79140 

gaactgaacc cttgaagcgg ctgtgggcag ggcagcaggg ctatggccac cccccaggtt 79200 

cgcctttgcc gacgcaagag cccaactgcc ctcctgggtg cgtccccacg tcagggtgta 79260 

tgacaacttt ggccatgtca tccgagacgt ggcccagttc ttccgcgttg ccgagcgaac 79320 

tgtgagttcc tgcccaggga ggggacgagg gtgttgtccc cagaggagcc agaaatgggt 79380 

ccacccaccc ccatggttct gcagatgcca gcgccggccc cccgggctac agcacccagt 79440 

gtgcgtggag aagatgctgt cagcgaggcc aagtcgcccg gccccttctt ctccaccagg 79500 

aaagctaaga gtctggacct gcatgtcccc agcctgaagc agaggtcctc aggtgcggac 7 9560 

gggcagcgct gggtgggcgg cgtgggggtg gcggagcggg cggcgtgggg cgggcagcac 79620 

caggcgccca gggcggaggc gactcacctg gctttgtgcg ctccccctcc cacctccaaa 79680 

ggctgcctct ccctcctagg gcagggcccc cacgggctgc aaccctcccc tacaggcaga 79740 

gaacgcccca ggcaaggatg ccccccgagg ctgagactcc ccccaatagc agggaggaca 79800 

cccacaggca ggaccccaag tgccgggact ctcccccaag aggggccttg ccacaggcag 79860 

ggaccccagc tggggccccc cgtgggcctc actgcgcact cgggtgcccc tgcagggtca 79920 

ccagctgccg gggaccccga gagtagcctg tgtgtggagt atgagcagga gccagttcct 79980 

gcccggcaga ggcccagggg gctgctggcc . gccctggagc acagcgaaca gcgggcgggg 80040 

agccctggcg aggagcaggt acagt tccag ' gg.ccttggga tggacacaga ccctctgtct 80100 

cctgaggcca acccgacccc gcccatctgg cctcaggcac ctccccacac acccctgtaa 80160 

atcccctgcc cggcaggcag gcgggcaagc gggcggggga tcccagctgc ctggctgcct 80220 

gtgggtcctc caccccacct cacccacagg ctgctggctc ccaggtggtg catgccctgg 802.80 

ccctccgcgg gtgcccccca catcactttg gttctctggc gggtcagctt ggctcagtgc 80340 

actcaaggtc gggcgcccct' gccactggct gcgcttgagg ctggcctttc tccaggaatg 80400 

cgctgcgggt ggaacccagg ttccttcttc cttggggcct tttgccccag aagcccataa 80460 

ttcctcaggc caacccgaaa ctttctccct gcttcctgct gggagccatt cccctcttcc 80520 

cgcccatccc tgcccttcag gcccctggag tgagctccag gtgcaggcac caggcacctg 80580 

tgtccccttc ctgccagccc ctcgctgtgg tcggactgtc ttccctggac ctgctcttac 80640 

aagtcaccac ctgcgagcct catgagccgc 'tggtgtgact tggacaggac caagttgtgg 80700 

cactgtcacc ggggtgtgct gtgcccccct cccccgacct ccatcttggc tcagggctcc 80760 

ttgggaccat cttccctgtg cgcccaggcg ctttgggacc ccagagtgtg tgpttggggc 80820 

ctgtgtgtgg ttgtgagctg tgtcctcctc aggcccacag ctgctccacc ctgtccctcc 80880 

tgtctgagaa gaggccggca gaagaaccgc gaggagggag gaagaagatc cggctggtca 80940 

gccacccggt gcgtgagctg tccctgcacc tgtgccgacc accatagaca cgcatgggaa 81000 

cgcagccgtg ggtgccccca gccacggctg gtcccgatgg gaccagggaa tccaccccca 81060. 

ggagctgatg tccagggcag ctgtgatgct gacggccagg ggctcaagtg tgtggtttct 81120 

tctgcagggg gctcatgagt cccagctgga atcaggcccc acccttgggc aggtttggca 81180 

tggggcctgc agcactgggc ttggccctgg catttccctc aagtgtggac gcacacctgc 81240' 

ctcatgtgag ggacacagcc cattcctagc cttggatcaa agaacggagt tatagccgga 81300 
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gccaggaagc cccctgcctg ctggaaaacc ccaagtgtgg cggcctttgt ccatgtccct 81360 

tggcttctgg gaagaactgg gtggtgccca ggcagggctg gtgccatcag gaagtgggtg 81420 

gctgctgagg ggcctgggcc ggcgagggcc tgggtgggga gtgcctgggc cgcccctgcc 81480 

ttggtttcca cgtttccgtg ttggtctggg gtgtgtagag agatgggcac tgctcatccg 31540 

gaagcccctc cttgtgcgct gccatcctgg gagcctcagc cgcatccgct gtggggcagg 81600 

gggcttgagg gaggaggaga gagacgggcc atgcaggacc cctggcttga ggcagagcca 81660 

atctaccctt tgcccattca ctgctctcag ttccctgcca gcctctcact gtgtgacctc 81720 

agacgggccc agccccacag ctttcttccc gcagcccctc cctatgtcca tccagccagc 81780 

cagtttctca ggcagcagcc ccacctcggc agtcactgtc ccagggaacg ctcaatgttc 81340 

caaggaaggc tctgcagccc cagggaccag atgatgaggc tggccctgat ggagcctcgg 81900 

gcctgtgtcc tgcaggagga gcccgtggct ggtgcacaga cggacagggc caagctcttc 81960 

atggtggccg tgaagcagga gttgagccaa gccaactttg ccaccttcac ccaggccctg 82020 

caggactaca agggttccga tgacttcgcc gccctggccg cctgtctcgg ccccctcttt 82080 

gctgaggacc ccaagaagca caacctgctc caaggcgccc tggcttgcag aggccaccca 82140 

ccctgagggc agtgctgccg ccgcgtgtgg ggtgggggcc atctgggtcc aaggtggtct 82200 

ctgttctcta gagaaaaagg ggcagatggg gacagacgcc ccttcctcta caggcttcta 82260 

ccagt.ttgtg cggccccacc ataagcagca gtttgaggag gtctgtatcc agctgacagg 82320- 

acgaggctgt ggctatcggc ccgagcacag cattccccga aggcagcggg cacagccggt 82380 

cctggacccc actggtaaat ggggccccag gtgggaccct cagactcct'g cgtggaaggc 82440 

agtgtgggcc agagtcccgg gctgcttggg gtgggcaccc ccgggccctg cttggccccg 82500 

cctctctgtt cccctatggg agtgacgggg gcctccacct ccaccaccag caccagcagc 82 560 

accacctcca ccttcaccac caccacctcc accaccacca cctccaccac ctccacctcc 82620 

accacctcca ccacctccac cacctccacc accaccacca cctccaccac caccaccacc 82 680 

accacctcca ccaccaccac caccaccacc acctccacct ccaccacctc caccaccacc 82740 

tccacctcca ccaccaccac cacctccacc tccaccacct ccacctccac ctccaccacc 82800 

accacctcca ccaccaccac caccacctcc acctccacca gcagcagcat cacttgttgg 82860 

ggagaccctg tgcaactcca tgcacagccc tgtccctgcc atagccccga cccctaagca 82920 

cagccctgtc caactgccac acgtcccctg cctcccatgc atggtcctgg ggggtcaact 82980 

gcacacgcca gggtcctagg gtcctagacc cctgtcctcc ctgtttctgc ctctgtttgg 83040 

ggtggagtcc aagtctccag aggcggaagc atctgtgttc gtgtgttaat gaacagcccc 83100 

tacagagttc ccctagttca cccagggggg aacctagcct gttgggacga ccccagatcc 83160 

cttctgggct tggtactcac tgggatatcc tcatgcctgc acccagccta cggctctgag 83220 

ctcctgagtg gggctttggc ctgcccgcca ctg.ttccagc ccccatccag caggctggtg 83280 

tctcctctga tgcccccagc acccaggcgt gtacctgcct gggttttccc gccctggtct 83340 

gaggtgggtg aggcctggcc tccctagcca gccctgcccc cccaccccag ggaactttcc 83400 

agatgctccc gaccagcttt gtggctctac atctcttcat caggaagaac ggcgccggat 83460 

cccaagctga ccgtgtccac ggctgcagcc cagcagctgg acccccaaga gcacctgaac 83520 

cagggcaggc cccacctgtc gcccaggcca cccccaacag gtagctgact cctgaaccgt 83580 

gtgcagccta cgacttggtg ggtccctcag tggcttcacg aggctaactc ttgagtgtgg 83640 

ccggggctgc ccctgtgggg agccatctca tggtggggac tgctcccggt tctgcacccc 83700 

gcagtcgtcc tgagcagc.tc tccaggagtt cctggaggaa gggcgggcag ggcggtggga 83760 

ctctcagtcc tccaccccag cgccactctg agccatgcta ctcccacacc aggagaccct 83820 

ggcagccaac cacagtgggg gtctggagtg cccagagcag ggaagcaggg ccagcacgcc 83880 

gtgagcgcct acctggctga tgcccgcagg gccctggggt ccgcgggctg tagccaactc 83940 

ttggcagcgc tgacagccta taagcaagac gacgacctcg acaaggtgct ggctgtgttg 84000 

gccgccctga ccactgcaaa gccagaggac ttccccctgc tgcacagcaa gtggccctgg 84060 

cgtggggaac agccggtggg gtggggggca ggggacaaaa tgggggctgt gccgggtctg 84120 

attgaagctc cccgcagggt tcagcatgtt tgtgcgtcca caccacaagc agcgcttctc 84180 

acagacgtgc acagacctga ccggccggcc ctacccgggc atggagccac cgggacccca 84240 

ggaggagagg cttgccgtgc ctcctgtgct tacccacagg gctccccaac caggtagggc 84300 

acctgcctgg ctgctcctgg cagcgcccca accgcacgca gccctgggag tgagcagcaa 84360 

agccccaggc ccccctcaga ctcaagtctc tgtctccagg cccctcacgg tccgagaaga 84420 

ccgggaagac ccagagcaag atctcgtcct tccttagaca gaggccagca gggactgtgg 84480 

gggcgggcgg tgaggatgca ggtcccagcc agtcctcagg acctccccac gggcctgcag 84540 

catctgagtg gggtgagcct catgggagag acatcgctgg gcagcaggcc acgggagctc 84600 

cgggcgggcc cctctcagca ggctgtgtgt gccagggctg tggggcagag gacgtggtgc 84660 

ccttccag.tg ccctgcctgt gacttccagc gctgccaagc ctgctggcaa cggcaccttc 84720 

aggttggtgc ctggccacta cagttcctgc tgggtgtagc cccaggtgat gggctgaggg 84780 

ggaaagggca ggcccttgtc ctggtggcaa cgcctggcag acgtgtgcag tgggccggtt 84840 

gtctcacagg cctctaggat gtgcccagcc tgccacaccg cctccaggaa gcagagcgtc 84900 

atgcaggtct tctggccaga gccccagtga gtgcccacgg aggcccccag cacacccaac 84960 
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gtggcttgat cacctgcctg tccagctctg gtgggccaag aacccaccca acagaatagg 85020 

ccagcccatg ccagccggct tggcccgctg caggcctcag gcaggcgggg cccatggttg 85080 

gtccctgcgg tgggaccgga tctgggcctg cctctgagaa gccctgagct accttggggt 85140 

ctggggtggg tttctgggaa agtgcttccc cagaacttcc ctggctcctg gcctgtgagt 8520O 

ggtgccacag gggcacccca gctgagcccc tcaccgggaa ggaggagacc cccgtgggca 85260 

cgtgtccact tttaatcagg ggacagggct ctctaataaa gctgctggca gtgcccagga 85320 

cggtgtcttc gtggcctggg cttggtggtg ggagttgagg gacagggagt tggcagaggc 85380 

ccctcccagc ctgccatgtg acactgtact tcctccacgg tgggctcagc cctgccctca 85440 

tcctcacagc cgcagccaag ctgcagttgg taggggatcc accgacacac caggctgcct 85500 

gggctggtct ctgggttggg agctgcccca ggtgctgagg agggcagctc cctggctggt 85560 

gaggcccctc ccagaaccac ccttggactg agctctgggg agggatggta ccaggtgggt 85620 

gaggggggct gcctggggag ggaggggttc ctatggggcg tggcgaggct ggcccagccc 85680 

tctccccgcc catatatgta gggcagcagc aggatgggct tctggacttg ggcggcccct 85740 

ccgcaggcgg accgggggca aaggaggtgg catgtcggtc aggcacagca gggtcctgtg 85800 

tccgcgctga. gccgcgctct ccctgcfcca gcaaggacca tgagggcgct ggaggggcca 85860 

ggcctgtcgc tgctgtgcct ggtgttggcg ctgcctgccc tgctgccggt gccggctgta 85920 

cgcggagtgg cagaaacacc cacctacccc tggcgggacg cagagacagg ggagcggctg 85980 

gtgtgtgccc agtgcccccc aggcaccttt gtgcagcggc cgtgccgccg agacagcccc 36040 

acgacgtgtg gcccgtgtcc accgcgccac tacacgcagt tctggaacta cctggagcgc 86100 

tgccgctact gcaacgtcct ctgcggggag cgtgaggagg aggcacgggc ttgccacgcc 86160 

acccacaacc gcgcctgccg ctgccgcacc ggcttcttcg cgcacgctgg tttctgcttg 86220 

gagcacgcat cgtgtccacc tggtgccggc gtgattgccc cgggtgagag ctgggcgagg 86280 

ggaggggccc ccaggagtgg tggccggagg . tgtggcaggg gtcagg.ttgc tggtcccagc 86340 

cttgcaccct gagctaggac accagttccc ctgaccctgt tcttccctcc tggctgcagg 86400 

cacccccagc cagaacacgc agtgccagcc gtgcccccca ggcaccttct cagccagcag 86460 
ttccagctca gagcagtgcc agccccaccg caactgcacg gccctgggcc tggccctcaa 
tgtgccaggc tcttcctccc atgacacgct gtgcaccagc tgcactggct tccccctcag 
caccagggta ccaggtgagc cagaggcctg agggggcagc acactgcagg ccaggcccac 

ttgtgccctc actcctgccc ctgcacgtgc acctagcctg aggcatgcca gctggctctg 86700 

ggaaggggcc acagtggatt tgaggggtca ggggtccctc cactagatcc/ jrcaccaagtc 86760 

tgccctctca ggggtggctg agaatttgga tctgagccag ggcacagcct cccctgggga 86820 

gctctgggaa agtgggcagc aatctcctaa ctgcccgagg ggaaggtggc tggctcctct 86880 

gacacggaga aaccgaggcc tgatggtaac tctcctaact gcctgagagg aaggtggctg 86940 

cctcctctga catggggaaa ccgaggccca atgttaacca ctgttgagaa gtcacagggg 87000 

gaagtgaccc ccttaacatc aagtcaggtc cggtccatct gcaggtccca actcgcccct 87060 

tccgatggcc caggagcccc aagcccttgc ctgggccccc ttgcctcttg cagccaaggt 87120 
ccgagtggcc actcctgccc cctaggcctt tgctccagct ctctgaccga aggctcctgc 87180 
cccttctcca gtccccatcg ttgcactgcc ctctccagca cggctcactg cacagggatt 87240 
tctctctcct gcaaaccccc cgagtggggc ccagaaagca gggtacctgg cagcccccgc 87300 
cagtgtgtgt gggtgaaatg atcggaccgc. tgcctcccca ccccactgca ggagctgagg 87360 
agtgtgagcg tgccgtcatc gactttgtgg ctttccagga catctccatc aagaggctgc 87420 
agcggctgct gcaggccctc gaggccccgg agggctgggg tccgacacca agggcgggcc 87480 
gcgcggcctt gcagctgaag, ctgcgtcggc ggctcacgga gctcctgggg gcgcaggacg 87540 
gggcgctgct ggtgcggctg ctgcaggcgc tgcgcgtggc caggatgccc gggctggagc 87600 
ggagcgtccg tgagcgcttc ctccctgtgc actgatcctg gccccctctt atttattcta 87660 
catccttggc accccacttg cactgaaaga ggcttttttt taaatagaag aaatgaggtt 87720 
tcttaaagct tatttttaca aagctttttc ataaaactgg ttgtagttgc acagctactg 87780 
ggagggcagc cggggacacc tgagccgccc gctgtgccca gatccctcag gctgcctgcc 87840 
atcagaactg ctgcccgggg cttcccctac ctcagacaga ccctccctgg gaggatcagt 87900 
ggggagtgcc acctctgccc ccagtggctg tggcacgtgg caggggcccc tgaagctcag 87960 
cgagggtcag ggcctgggag ggtatcattg ctggaagaac aggatggggc tcaggccagc 88020 
^ cctagtcgcc ggggcccaca ctaacccccc acttatgaat tcctcccact cccaactcac 88080 
aggggatttc ccgagagggg acctgccaaa gacctcctcc aggcctccca tgcttcccgg 88140 
gaagtgaagc ttctccccct ctggggcagg ctctgaagcc tcccgatgca cccagagcaa 88200 
ccagggggct gcaccagcca ctcgcctccc cagcacggcc aggttcccgg ggctggaggt 38260 
cccccccagg tcctgggaac caacctgcag aacacacaca gggtcccctg gagaggacgc 88320 
ggggacttcc agggcccgac tcctgtgagt cacagccccg cagctgctgc gccaccccca 
ccctgactca tgccccttcc cagcagctcc tcccaggacc ccatgtcctt cccacatccg 
caggaaggga gtgcctggac tctccaggcc cacctgggga gcccctcacc tgcccaccag 
cccctgagca gcccagtaac accatcaccg tgtccaacag ccaggagcct ccaccctcca 
ggagggaagg gatggacaga gccacactcg ccgtctttat- tttgcactca ccc.tgggtga 88620 
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caccgggcag gccgctcctg cccacagcca gactgaggaa gaacacagca ctcggcaggc 88680 

ccagcggggt ccgcgcaggg aggaccccag gaccagccct actcccgagc aggggacaca 88740 

gggccccaca gagaacccct ccgggaggtt ctctcctggc tgggggaggg ctctggaccc 88800 

ccacaaacac tccccaactt gcggggctgg ggcacaaaaa cagccacccc cagcaggccc 88 860 

cctcagcttt ttgcatcagt cagctccctc -ccgggggatt agggtgaggt gaagccaggc 88920 

ccaggcgtgg ggtataggtc ttcccccgca ggcctcagcc ctgtcccgag gctgcatcac 88980 

aatccagggc ccccgctggc ctttgggaac acggcctggg tcttcctcaa ggcaagatca 89040 

gccccagacc acttccgggg ccacggggtc acagggcaga agccagatgg cagccatggc 89100 

tgacgggcct cctcctcgat ggggcggaga cagccacggg gtctcccgag ggtcccacag 89160 

ggctgtcctc atgcagccca agccagcctg agcactggag ccccaattcc caaccaggtc 89220 

tccctcagac cccccag.aaa gggcctcgaa aggccgccgc. tgcgccctgt ggaaaggctg. 89280 

ccgctgcagg g.cctgggcca gccgggctgc cagactcccc tccaaagcct ccggatgcct 89340 

acgcttttcc agacatagag gaaagtttgt cctcgagaaa acaaagtaaa tagaagaacc 89400 

ccaaagcaaa gcaaacccac cccccagatc agcagcatgg gagccaacag gaggccactc - 89460 

ctccagcacc aggggaccag ccgtcccgac ggcagcgcgg ctgcgcctac gtgatgtccc 89520 

tctgccgcgg cggccggtgc acattccgca cgacacactt caccatccac tcgatgccct 89580 

cgcgcacccc tttgctgtga agacagcggg Cgtgaggcgg ggggtctcgg tccccaaagc 89640 

ccccgcaggt. gcagccccca ctcaccctgt gagggccgag caggcctggg . tcaggcaatc 89700 

gcgcctgccg atcttgctgg tgcagtcgct gaaggccgtc ccgatgtcag ggactgagag 89760 

gcacgtctgg gggaggtaag gccgtgagga gcagccccca cgcctggccc tgtcctgccc 89820 

gtgggcccgg gactctcaga agggcgtatg cccttcaccc cagggaaaca gccagagctc 89880 

caccagggtc ccagtgtctc ccacagagac cacagcagtg aggaccctgt gctcagcccg 89940 

aggctgaaca tggctggtag tgcctgagac aaactagacg tccacacggc tccaaggagt 90000 

ccacccccca tcccctccct gggggacacc ctgagccccg aggtggggcg ctgaggactg 90060 

aggcctcctg ggcagtggcg gaggcaggtc ccaggggccc acacagccgg ggatgacgga 90120 

gaggtgggag ccctgcatca gtgatggggg cagtctgcag ccatggtggc ttctgcccac 90180 

aaccacctgc ccagtcttca aaaagcagcc ctcccctccc cctttcctcc gaggggagac 90240 

ccctgccccg taccagatgt ccctcttgtc ggctgagatt gtaggggagg ccagccttac 90300 

aggctggggg caacagagcc accccagaga aggcaggaag tgaagattca cccggccccc 90360 

cggacgccgg gctgcttctg tgcaaagcca ccccaagaga acagctagaa ctcagcgtgg 90420 

ccagtgctcc cgggggcagt ggcacctcag aggggtcttg aggggctgcc ctgggggtgg 90480 

ggctggcaca gatgccacct ccaagggtag caggaacagg taagggccag agctgacccc 90540 

caccagggcc ccagcatcac ttctttgagc tctgagtttc acctgggtgt ccccacagcc 90600 

tggccacaca ctcctgagac acggccgccc tcctggggag aggtgccctg catagcagga 90660 

agaggcccct gggcgcctgc cctgaggtgg gagaacctcc agggctggca gcagcaggtc 90720 

tggagaggaa ccaagcttgg gaagctgctg ggggcagggc aggccttgag aatggctctg 90780 

taccccctgg gcagtcactg ggcctggggt gtctgggtgc acacctactc cccttgctgt 90840 

gggggaggct ggggactcgg gaagctgctg cgggaggcag gggtggggct cacctccaca 90900 

"tcctgcttgt tggccagcac caagacgggg^ acaccgcaca gcgcctcgct ggtcaccacc ..' 90960 

tcctctgggg agggcaggag aggcagcgcc tcacacccag catcctgcct ctgactgccc 91020 

aggggcccac aggcgtggac actgtgacag ccactccctc tgcccccccc ccgtcaccca 91080- 

ctaggcagga gcacttctga ccagacactg agcctgcccc aggcacagag ctgcccaagc 91140 

tggacctgcc cccactcacc acccatccct cccagagcag ccaggccgca. ctcaccaaac 91200 

gcctgcttgg actcagccag cctctcctcg ccggtggagt caacgacgta gatgacgccg 91260 

tgacactccg cataatactg ggaggaagca ccaggagttg gggctcagtc cccaccctgc 91320 

caagggccag cagagccagg cctgtgtcat ggccacagtg aggggctcac atgaggaagg 91380 

ggcaagaggg cagcccccaa ctgcaagacc cttctgggat gcattctggg gttgcgggga 91440 

gacctggtgg aggtgtcccc agacgctgct cctgagaacc tgccggcaac ctttggcctg 91500 

atggtggcca aaggcgaaag acagggattg ggccaggcgt ggtggctcac acttactatc 91560 

ccaacacttt gggaggcaga agcaggagga tcacctgagc ccacttcacg gccaacccgg 91620 

gcaacacagt gagactccgt ccgtacaaaa gcccacggta atgtgcgcct gcagtcctag 91680 

ccactcggga ggctgaggtg ggaggatggc ttgagcctgg gaggttgagg ctgtagtgag 91740 

ctctgatcac accactgcac tccagcctgg gtgagaatga gagaccctgt ctcaaaaaaa 91800 

agatagggtt Cgggggctgg aggaacctag accacagcct ggcccgttga gggagtgcac ■ 91360 

ctgtggggct ctgtgccagc acctcgcaca gggagggagt gcggccatgc ggataagact 91920 

gaccagcacc atctacgaag cgagccttcc ctgccaggac agggccagag tcactgagct 91980 

cagacccctg cagcctgggc tggtcagtcc tgggctcgct ggcaacactc ctgggcaaga 92040 

cagggcacag cccctgcagc ctcaggtaca agtgctgagc cctggaccag atgagtgcac 92100 

ctctatctca atcagaaaaa aacacagcaa actccgcgtc cacgtggagc agacaacagc 92160 

tcacatctgc cactttgcct ccaggctgtg ccagctctcc tgtccaggca cgagtgccca 92220 

gagacctaga actggatgct gaccaggtag gacaagctgg tggtcagtgt gttaagacac 92280 
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acacacccga gagcatgaga agccaggagg cacagcccaa ctctccgaaa tccttagggt 92340 

gtctgagcag ggagtaccag acaaccccat cccagtgcca gacaagcttg tgcacctgca 92400 

cttcccacag aggagagaag cctgtgcacc tgcacttccc acagtggaaa ggaggaggcc 92460 

caaggccagg cccccccacc cccaggaact tcccacagtg gagaggaggc ccaaggccag 92520 

gcgccctcca gggttctgca ggtagcgagg cccccccacc cccaggaact tctctggcct 92580 

acagacaggt cccacacaga ggccgccaac ccctcaaggg accctgcagt gtgccggctg 92640 

tctgctgctg acacaaggga gcaggcggac cctaaggtgg. agacctctgt ggcaggaggg 92700 

gcggc.tctgc ggaggctgca gcaagcccag tgagagaatc ■ tccacgtggc tcctggggct 92760 

Cctgagcagg gtggcagaag gt.tcatgtgc aaccgggtcc tggaccatgg gaccacgtgg 92820 

ccagagccac ccatcacacc taccaggcac aaggtgcaca gcccagcagg gccgcagtgg 92880- 

acgggagcga cacctcaggg ctgagtgcgg gcaggaccca gagccccacg ccccagtgga 92940 

ggcgtcacag cagtggtcat tgtggggtgc cccacaagga gggggaagag ggaggtgccc 93000 

cagcgtggct cctggctggc cagctgaccc cagtggagca gtcagaggga ctgtgggtct. 93060 

gagtttttct ccccagcagc aatgggagct ccccaactgc aaagtgccag ccagcctgag 93120 

agactag.tgt tacagcaaag aacccaggag ctgaggtcct ggcacatgcc acacatgtgg 93180 

acaccaaccc agggtccagc cccaggacga ggccaattcg caatgacgcc cctttctgtg 93240 

gtgctggctc . tg.cacaagga cgcaggatac aggaaccagg gtgggagcag gggcctccct 93300 

.tccggtccct cccagtgacc taggggggtc cctgcagctg atcctcccag ctctgagctc 93360 

agcagggtca ggggtcccgg ccaccagagc agcacacact cagcagacac gctgaatgac 93420 

gagccacagc tgcctcatgg gcatgacttg cacctcatgt ctaggagacc ctggtgggca 93480 

ggagatgggg ctgccatccc acagctgtcc cacagctggg gacccaggga gccactggcc 93540 

ccaccacggt ggtgtctgga gaagggctca gactgccagg aagtcgcacc ccagcagaag 93600 

tggtagtgaa ttgggagggc actcaaggaa gggctgtgca gccccaagac cagcagcaag 93 660 

gatgggctac agcggccccc Ctaagtctcc ctcttccagt ttcgcctcaa gagaggccct 93720 

caggaccttg gaggaacccc tctccaacgt ggaagtgtgg gtccacatag ggctgcagct 93780 

gtggccagtg caggcatctc cggccccact gtactcttgc ttcacgttgg agaacactgc 93840 

accagcagat ggtctcattt tggtttctgt gggacccact ttggctgcaa agagccacac 93900 

tgccaggtca cacctgccca gggcagccca cactggggac ccaccaggcc atggtgtgaa 93960 

gt.cccggcca gcctggcccc acatggcaca gcatagccag ttctcctcca gggctccctg. 94020 

ccgggccaac cacagctctg cggatcctgc tgcctgagtc gacctctcct ctcccgtcct 94080 

ccctgccttc ctggtgccga cccccagtgt gcatcctgta cctcgacctg tctcagcatc 94140 

tgtgcctgag. acaccggccc gcgacaagat catcatcatc tgtgtcaccc cccaagcatg 94200 

ctgcgcactg gacacacagg ccctgactca acttgtcctg tctgact.tca gtggtcctac 94260 

aggatctatc agagatcact tggccatggg agaaatgtct tcttggctag aagtcacagc 94320 

aggaggggac actttggggg cgcctaggaa aggggaacta ggatcaaaaa agagatcagg 94380 

acctgggcac tcagctctag agatggcatc agggcagcca aggcactggg gacaccccac 94440 

acccactgtg ccagcctagg gcagggagcc cgaggaagcc acaggctctg ccctgctcag 94500 

tgctggactc agtgcctggc ccaggctgag aaggagataa actgcagcct tgggggtgtg. 94560 

gggaaggggc" accacactgg gatctcagaa atgcccaaaa cctgtgtcaa aataggagac 94620 

tgccgctgtg agaccctgag gagtcttctg gtgatcatgg aagaacaaat gttaagctag 94680 

aactgaagga acctcatcag . gggagaggca gccatcctgc cgtccccaca tctggtcttt 94740 

gccatttctg Cgtcctgtgg tggtcagcag caaggtctct gagccgaaag gaggcactca. 94800 

ctttggagga gtgcagggtc cccaggtccc cacactttgt cttgtcctga ctgagaaaga 94860 

aacagactgc cctgacctct ctgacttggc cagcgaggtt gcccttaggc tcaaacccaa 94920 

gccagggttc gaacattccc agacacttgt aagatgttta ggttgttaac ataatgttca 94980 

ggtttcaaaa cattgaaaga aactagcccc agcccngaac ccagatcccc cccggcttca 95040 

ggcatgacca gtgaacacgc ccttctctca ccggtcacct gaggatgccg cactctgtca 95100 

acaggttccc ctaatacatg ctccgatctg atcgccttgg catttagtga ttctttccct 95160 

ggaattctcc actggcccca tcgcagggaa ctcccaagtg ggaaactccc ctaccaccac 95220 

ctccggggca acttcagcta agggttcagc tgggacaaaa cagggagcca ctcgggaacc 95280 

tgggacagga ccagagagaa aacccgaggg acagagtggg taaggaaagc tgctgaggaa 95340 

• gggcccaaag ggcactctgg aaagaagtgg cactggaggg ctggggtggg ggtggtcctg 95400 

gccagggagt cttaccttgt cccacaaaga ctgcagctct tcctgccctc ctaagtccca 95460 

gaacatgagc cgagcctttc ccacatccac agcgccgact ggggagagga ggaaacaggc 95520 

aaggctcatg accttggtcc tcgacacacc cagtcccagc tctcccaggg gatggggcaa 95580 

accatgctgg tgccactcaa atgagacttg agaggggccc gacagggctg tggccacggg 95640 

ccagctggac tgtgaatatc acggcatcct caaggcccca aacccacagc ctgctattga 95700 

gacccttact gtttaggccc acggtggtgg tgattttgga tagactcatc cccttgtagt 95760 

tcttgttaaa tcgggttttc gactgctcca ggaaggtctg aggagagagg cagaggcgaa 95820 

acacatcaag gaggggctat actggcttcc aaatatcctt actcaggtct gttctttaaa 95880 

agacagaaac agaaacagag caacactctg cccttcagga ggctggtggt gactatcctg 95940 
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ccgtctcagg tgaaatttgg cttccgtctg ggtagtgaac gtgcagctga cagcacaaaa 96000 

ccgaaggggg cgccgccagg ccgtgggaaa ggtgcgcgca agggcgtggg cacccaccgt 96060 

cttcccagca ttgtccaggc ccaggatcag gacgcagtac ccgtccttct gaaacatgta 96120 

cttgtacaag cccgacagca gcgtgtacat cctgccctgg gcaccccaac ataggtcagt 96180 

gtgcagccag aaagcacctc ccctcccccg ggcttctcca cggtggccag tggcgcccca 96240 

cgtccagccg accgctcagg acgagagcct gggggccatt cccgactcct cgtccctctc 96300 

ccaccccgtc cctctgtaac ttctcccagg tcagccgcca ctgtgtcctg ctcacagcaa 96360 

tgactgcgac ctctccgcat acacatcggt tccggcccct cccctgctcg cgggactacc 96420 

cagccgggtg ttcacagtga gctcagccgc gctcccgccc tcccccgagg cttcgctccc 96480 

acgcttcacg cgcgcggaac ggggaacaca ctcgctgcag ccccgcctgg gccacggcac 96540 

cctcgagcgc cagccccgcg ccccacccgg gagcagcgag ccaccggcgc gctccccagg 96600 

agcccctgca ggcgccgggt agggacgccc catcacccca tttcttaaaa cggggacggc 96660 

.cctgggggga gcggactaca gggcgggtga gcagcggcgc ggctgctcct ggagtg.cacc 96720 

tggaggcggc gcgcggctgg eagggaacga ctgcgaagga agaacctggg tcgcggcccc 96730 

cggctacgtc cgccccaagc cgccgccgcc aggtctgagg ctccccgaca agcagccaaa 96840 

gctggctcct gtcacacccg cgtcccacct cgagtcctgg gccgcccctc gggcctcgcg 96900 

cctcaccgca cagcctgcgg cctacctgcg tccgccgcgc cctcggagcc gctgctgctg 96960 

acccccgctg acctccgctg accccgcgct aaccccgcgc ggcgcctgac gggacgcggg 97020 

ccggcctcag ggaatgagct gaaccgcgtc ccagcggcct ccgcgctccg cttcccggct 97080 

gcccccgcgc gccaagcact tccggaagcg gcggcgctcg ggaggaagtg ccgatcggct 97140 

gctggggcga aaagggggcg ccgggccgct ctagccggtg aggccggcgg gctctctgtg 97200 

gctgcggctg ggaaaccgcg cggaggaggt gcccggccgg ggaccaggtg gccgcggttt 97260 

gcggggacgc ggccctggcc agacagaaga gacgccgggc gggggggcgc ggccggcctg 97320 

gaaggcggcg ggcgcggcgg gtgggctcgg cggagggtga ggcggcgggg cgccccgcgg 97380 

ggaaggggct ccggagtgac gcgggacccg gctagcggcg agcccacggc ggctcggaag 97440 

ggaagcgcgg agcctgagcg ggggtacccg ggctgcgacc; tctgcgctgg gagctgtgcc 97 500 

cctgagccgg tgtctccccg agggaaaggg gacgtgcccg tgcccgtgcc cgccctcagg 97560 

ctgtggggtc ggtcccgaga cgcggggctc agctggcttc tcttcttgca gccctggtcc 97620 

agcgcctccc tctctcagca tggacgagga gagcctggag tcggccttgc agacctaccg 97 680 

tgcgcagctg cagcaggtgg agctggcctt gggcgccggc ctggattcgt ctgagcaggc 97740 

tgacctgcgc cagctgcagg gggacctgaa .ggagctcatc gagctcaccg aggccagcct 97800 

ggtgtctgtc aggaagagca ggttgttggc cgcgctggac gaagagcgcc cgggccgcca 97 860 

ggaagatgct gagtaccagg ctttccggga ggccatcact gaggcggtgg aggcaccagc 97920 

agcggcccgt gggtccggat 'cagagaccgt ccctaaagca gaggcggggc cagaatctgc 97980 

ggcaggtggg caggaggagg aagagggaga ggacgaggaa gagctgagtg ggacaaaggt 98040 

gagcgcgccc tactacagct cctggggcac tctggagtat cacaacgcca tggtggtggg 98100 

aacggaagag gcggaggatg gctcggcggg tgtccgtgtg ctttacctgt accccactca 98160 

caagtctctg aagccgtgcc cg.ttcttcct ggagggaaag tgccgcttta aggagaactg 98220 

caggtaaagc cctttgttgt cagatgccaa ccttaggggc gtaaggggca cgcacacagg 98280 

gtcgggtcag gatcggccct ccctttgctt tgcagttttg tctcagcttc ctggggcagg 98340 

cgtgctttga cagctgtgtc tgtgttcagg cgtctacgtc ctcctcctgg ggtgaatcaa 98400 

gaagcatgga aggaggccag gcgcggtggc tcacgcctgt aatcccagca ctttaggaag 98460 

ccgaggcggg cagatcacct gaggtcagga gttcaagacc acgctggtca acatggtgaa 98520 

accccatctc cttaaaaaca caaaaacgaa ccggtcgtgg tggcgcgcac ctgtggtcct 98580 

ggctactcag gaggctgagg caggagaatt ggttgaaccc aggaggccga gtttgcagtg 98640 

agtggagatg cagccactgt actgcagccc gagcagcagt gcaaggctta tgtggaagag 98700 

agtaggtctc cagcctatcg tcagtttttt tttggtggtt gttttaattc tttttgagac 98760 

agggtcttac tttgtcaacc aggctggagt gcagtggcat agtcctggct cactgcagcc 98820 

tggacctccn gggctcaacc gatcctcctg cctcagcccc cctaggagct gggctacaga 98880 

ctcacgctac tacacccagc taatttttat attactataa ttttttatct tttttttgag 98940 

acgg.ag.tctt gttctgttgc ccaggctgga gtgcagtggc gtgatctcgg ctcactgcaa 99000 

gctccgcctc ccgggttcac gccattctcc tgcctcagcc tcccgagtag ctgggactac 99060 

aggcgcccgc caccatgtct ggctaatttt ctgtattttt agtagag'acg gggtttcacc 99120 

atgttagcca ggatggtctc aatctcctga cctcgtgatc cgcccacctt ggcctcccaa 99180 

agtgctggga tgacaagcgt gagccaccgc gcctggcctt ttttttttgg agacagagtt 99240 

ccactctcct cacccaggct ggagtgtagt- ggcgcaatct cagcttaccg caacctctgt .99300 

ctcccgggtt gaagtaattc tctacctcag cgtccagagt agctggcact acaggcgccc 993:60 

gccaccacac tcggctaatt ttttgtattt ttagtagagt cggagattca ccatettggc 99420 

caggctggtc ttgaacccct gacctcgtga tccacccacc ttggcccccc aaagtgctgg . 99480 

gatcacaggc gtgagccact gcgcctggcc ctgttgttag ttttattctc tagagttcaa 99540 

cttttaaatt. ttactttcat ggagattttc aaacataccc caaattagag agtttagcat 99600 
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aatcaccgcc 
tttgtctgtt 
aagttactgg 
aggaaattac 
gtaacttctg 
atgaacaatt 
tttttacttt 
tagccaatag 
gtcttccttt 
gcagtggcac 
■cctcagcctc 
tattttttgt 
gatgagagat 
ttcatatact 
ctgacagagg 
ggctccagtt 
tcggcatcag 
attcaggact 
. agtttcactg 
agggattctc 
gtgcccagct 
gtctcgaacc 
gcatgagcca 
aagtatatag 
tgtectttga 
gcctatgttt 
cctatgtttt 
tttgagttaa 
atttccagtt 
tggcactttt 
tcctgttcca 
ataagtttta 
ttttgtttct 
cataaaagac 
acattgtcat 
cccagtactt 
ctggccaacg 
gcgcacacct 
gaggaggagg 
gagactccat 
■ tcatgaatga 
actgttcaag 
aattgttttt 
agtgcagtgg 
ctgctcagcc 
tttttttgag. 
ctcactgcaa 
ctgggaatac 
gggtttcacc 
cggcctccca 
ccttagttac 
ggctcacgcc 
ggagttcgag 
ttagtctggc 
aattgcttga 
gcttggggaa 
tttttatgtt 
atgcactatt 
tatatgtgaa 
ttcttgctta 
cgggccgggg 



cacggtccat 
ctccctgccc 
ttttttccaa 
ttaaaaagta 
tgtcataata 
attactgtaa 
tctcttagaa 
ttcaggctga 
tttttttttt 
aatctcggct 
cctaatagct 
agagacggga 
ctgacacatc 
ttcattcatc 
aggaaacagg' 
gcccccagcc 
gcctgatctg 
gacaagttcc 
ttgctgccca 
ctgcctcagc 
aacttttaca 
cttgacctca 
ccacacccgg 
tttgcagata 
tgcacaaaca 
ttgtcatcag 
cttccaagag 
tttttgtata 
ttcccagcac 
gtccaaaatc 
ttggtgtata 
aactcaggaa 
ggatccttcg 
agtttgaatt 
ctttacaata 
tgggaggctg 
tggtgaaacc 
gtaatcccag. 
ttgcagtgag 
cttaaacaac 
aggatctgtc 
tcttgcacat 
ctcttccttt 
cacaatcttg 
tcccaagtag 
atggag-tctt 
gctccacctc 
aggtgcctgc 
atgtagccag 
aagtgctggg 
attttcaggc 
tgtaatccca 
accagcctgg 
atggtggcag 
acctgggagg 
caagagtgag 
aatcctgtat 
taggattttc 
tagagataat 
actgccttag 
gtggtggctc 



catccaatgt 
tatttttttc 
ttatgaatac 
actttttgga 
ggtaacacat 
tgtctctatt 
gaaatattta 
ccgtgtaacc 
ttgaaacagc 
cactgcaatc 
gatactacaa 
tttcaccata 
cctgtggtgc 
tgtgcaacag 
agtagaagaa 
ctcgttccgt 
aaagcttccg 
cttggatttg 
ggctagagta 
ctcccaagta 
tttttagtag 
ggtgatcccg 
cccaggatt t 
tttgctccca 
ttttaaattt 
gtttaagaaa 
ttctgtagtt 
tgttattagg 
catttgttta 
aattggcaat 
tgtgccagta 
gtggtagtta 
caatttcata 
ttgatatgga 
ttaagccttc 
aggcgggcat 
ccgtctctac 
ctacaggaga 
ctgagatcat 
aacaataaca 
catttattta 
tcttggttaa 
ttttttttga 
gctcactgca 
■ ctgggatcac 
gctctgttgc 
ccgggttcac 
caccacgccc 
gatggtctcg 
attacagatg 
tgtttgttgc 
gcactttggg 
ccaacatggt 
gtgcctgtaa 
cggaggttgc 
acttcatctc 
tgattgctga 
gatatataca 
tgtagtcttt 
ctagaacttc 
acacctgtgt 



cgtcatttat 
tttcctgggc 
ttcatagttg 
agtccaagtc 
ttaatggtaa 
tccctaagcg 
ccaagccttc 
atccctagtt 
gtcttgctgc 
tccgcctccc 
gtgcgcactg 
ttacccaggt 
tccctctgga 
ccctgtaggt 
ctgagtggtc 
cccctgctcc 
gagcatctta 
cgttg.gacct 
caatggcacg 
gctgggatta 
aggcagggtt 
ccttggtttc 
ctttatatat 
ttgtttgggt 
tgatgcagtg 
ccacctcatc 
ttagctgtta 
tgagggtcca 
aaagactgct 
atatgtaagg 
ccacactgt t 
ttcaccattt 
tgaattttag 
ttgcatagaa 
tggctgggtg 
atcacttgag 
taaaaataaa 
gggtgaggca 
gccactgcac 
gaagaaaaaa 
cgtctttaat 
ataagtatta 
gacagagtct 
acctctgcct 
aggtgcctgc 
ccaggctgga 
accattcttc 
agctaatttt 
aactcttgac 
tgagccactg 
tagtatatag 
aggccaaggc 
gaaatccagc 
tcccatctac 
agtgagctga 
aaaaaaaaaa 
attggtttat 
atcatatata 
gtttctagtt 
aagtacgatg 
tcccagcact 



taatattttc 

cattttaaag 

catctctaat 

cgatgtgagg 

tacctct tec 

catagcttta 

tagtaggtaa 

ctagttctag 

tctgtcaccc 

tggcccaagc 

ccacgcccag 

ctcgaattcc 

ccaggcactg 

aggccctgca 

cagggcttca 

acccagtgct 

cagacgtcca 

tttttttttt 

acctccacct 

caggcgcctg 

tcaccgtgtt 

ecaaagtget 

tctggatatc 

tgtcttttca 

caatttattg 

catagttatg 

aatttaggtc 

ctttattctt 

ttttctccac 

gtttatttct 

ctgattatta 

gctcctcttt 

gateggcttg 

tgtgtagatc 

tggtggctga 

gtcaggagtt 

aaacaaatta 

ggagaatege 

tccagcctgg 

acagtattaa 

ttctttcaac 

tttttgatgc 

tgctctgtca 

cccgggttca 

caccacaccc 

gtgaagtggc 

cgcctcagcc 

ttgtattttt 

ctcaggtgat 

tgcccggctc 

aaatacaagc 

gggtggatca 

ctctattaaa 

tcaggaggct 

gatcgcgcca 

aaaaagaaat 

tagtgctaat 

ttcaatatat 

tgcatggcat 

ttgaataaaa 

ttgggaggtg 



ccagtctcat 

caaattccag 

ctaactgatt 

acaaaaaaga 

atattcaaat 

tttttcctcc 

ttttcttttt 

ttctttgaat 

aggctggagt 

catcctccca 

ctaatttttg 

tgatcccttt 

ctccaagggt 

gtcacaccat 

aggctcagag 

gettgecatg 

ccttgccacc 

tttaagatgg 

cctgggttca 

tcaccacgtg 

ggccaggctg 

gggattacag 

atcccttatg 

cttgatatag 

tttctttatt 

aggattttca 

tttgatccat 

ttgcatgtgg 

tgaatggtct 

gagctctctc 

tagctttgtg 

ttcaagtttg 

tccaattctg 

tgtttggggc 

cgcctgtaat 

caagaccagc 

gtcggaggtg 

ttgaacctgg 

gtaacagagg 

gtcttccaat 

agtattttgt 

ttctctaagg. 

cccaggctgg 

agcaattctt 

agctaatttt 

ccaatcttgg 

tcctgagtcg 

agtagagatg 

ctgcctgcct 

gagttgtttt 

tgggcaccgt 

cctgtggtca 

aatacaaaaa 

gaggcaagag 

ttgcactcca 

acagtggatt 

aggatttttt 

acaattaata 

ttatttcttt 

gtgactagag 

gaagtgggca 



99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800' 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101.820 
101880 
101940 
102000 
102060 
102120 
102130 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960' 
103020 
103080 
103140 
103200 
103260 
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gatcacttga 
ctaaaaatac 
agggtgagac 
tgccactcca 
aaaagtggct 
ttcctgatat 
gttttctgta 
gtgtttttta 
cccaggctgg 
ggggatcctc 
cagctaattt 
taaactccag 
gagtcactgc 
tcatgtgtgg- 
accatccttg 
tatttttttt 
ctaaagtgat 
ggccctatct 
ccgggctgga 
caattctcct 
gctaattttt 
cactcctgac 
agccaccgca 
aacattgctg 
gggatattgg 
taatattggc 
agggattggt 
acaatcatat 
attgactatt 
ataactgtag 
tctatttgtt 
ctgtcgccca 
agttcaagtg 
catacctggc 
ggtctcgaac 
acagacacaa 
taggaatttg 
tccttttata 
ggttttagca 
cactgtgtca. 
cccaggttca 
ccacacccag 
ccaggctgga 
caattctcct 
gctgattttt 
actccctacc 
gagccaccat 
ccaggctggt 
taggattaca 
aaaggtttgt 
tttcatttat 
ctcttttttt. 
tttcttttct 
tgtcacccag 
gcgattctcc 
atgtaagcat 
attgtttttt 
ccgtttggag 
gtcctcctgc 
taatttcttt 
cactttggga 



gatcagcagt 
aaaaattagc 
atgagaattg 
ctccagcctg 
agaacaaaca 
taggtggaaa 
gattcccttt 
tcatgaaagg 
agggcagtgg 
ctacttcatc 
tttaattttt 
ggctcaagca 
actgcaccca. 
tttttttcct 
aattcctcag 
gtagagatga 
cctcctgcct 
tttttctttt 
gtgagtggcg 
gcctcagcct 
tgtgttttta 
ctcgtgattc 
gcgagcctta 
gatttgattt 
tctgtagttt 
ctcatagaat 
gttaattctt 
atatagagag 
gattcaattt 
tgaaagtttc 
ataggtctat 
ggctggagtg 
attctcccac 
taatttttgt 
tcctgacctc 
gccaccgcgt 
tccattttgt 
gccctagtta 
atttgagtca 
cccaggctgg 
agcaattctg 
ctagttttgt 
gtgcagtggt 
gcctcagcct 
atatttttag 
tgaggtgatc 
gcccagccag 
cttgaaatcc 
ggcatgagcc 
caattttgtt 
ttccatttta 
. cccctaattt 
tttttttttt 
actggagtgc 
tgcctcagcc 
ttacagctac 
gttgtttttt 
agcagcgatg 
ctcagcctcc 
tataagtttt 
ggccaaggtg 



ttgagaccag 
tgggtgaggt 
cttgaacctg 
gacgacagag 
tctttatctt 
acttttagtc 
atcgagttga 
gtgttgattt 
catgatcatg 
ctcctgagta 
ctgtagaggt 
atcc.tgcctc 
gctgtgtggg 
ttcattttgt 
ataaagcacg 
gatttcactc 
cagcctccga 
tctttttttt 
cgatctcggc 
cctgagtagc 
gttgagacgg 
acccaccttg 
tctttttaac 
gctgctgttt 
tcttttcttg 
gagttaggaa 
cttcaaatgt 
agagagagag 
ccttattgaa 
tgaacttttc 
tcagattttc 
cagtggtgcc 
ctcagcctcc 
atttttagta 
aggtgatcca 
ccagccttga 
ttatgttatc 
tttctgtaag 
tctcttttct 
agtgcagtgg 
ccttagcctc 
tttgtttttt 
gcaatctcac 
cccgagtagc 
tagagatggg 
cgcccacctt 
tttttgtatt 
tgacctcagg 
accacgcatg 
gatctttttt 
acctttgttt 
ttcaaggtat 
tttttttttt 
agtggcatga 
tcctgagtag 
agatttccct 
gttgttgttt 
cgatcatagc 
caagtagctg 
tgcagaggcc 
ggtggatcac 



cctggccaac 

gatgtgcacc 

gggggcggag 

caagaaccct 

gttcctgatc 

tttcactgtt 

ggaaattctc 

tttttttaaa 

gctcactgca 

ggtgagacta 

agggtcctgc 

agcctcccaa 

atttttcaaa 

taatgtggta 

catattcatg 

tgttgcccaa 

aagcgctggg 

ttttttttga 

tcactgcaac 

tgggactaca 

tgttttgcca 

gcctcccgaa 

agttaaaagt 

tgttgagaat 

gcatgtct tt 

gtattcttta 

ttgatagaat 

agagatggac 

attgactttt 

tttcattgga 

tgtttcttct 

atcttggctc 

ccagtatctc 

ggaacagcat 

cccgcctcgg 

gtcagtttag 

taatctgtta 

atcagtagta 

tcttcttttt 

catgatcttg- 

ctgagtagct 

tgtttttgag 

tcattgcaac 

tggaactata 

atttcaccat 

ggcctcccaa 

tttagtagag 

ttatccacca 

gcctgtcttt 

gttgctgatc 

ccttttttct 

acagttaagt 

tttggttgct 

tctcagctca 

acgtttcccg 

ctaaacactg 

tgttgtttga 

tctgtagcct 

ggactacagg 

aggcacagtg 

ctaaggtcag 



acggcgaaac 

tgtagtccca 

gttgcagtga 

gtctttaaaa 

ttaggtggaa 

gaatatgatg 

ttatattcat 

gatagggtct 

acctcgaatt 

caggcatgag 

tttgctgccc 

agtgctgaga 

tgcttttttc 

tattgatttt 

gcgtattatc 

gctggtctca 

attataggca 

gacagagtct 

ctccatctcc 

ggtgcccgcc 

tgttggacag 

gtgctgggat 

ttaaggcctt 

atttgcatct 

gtatagcttt 

tattatggga 

tcaacagtga 

ttttcttttg 

ctttttggaa 

agttttttga 

tgagtcagtt 

actgcaactt 

ggactacagg 

ttcaccatgt 

cctcacaaag 

atagtttgca 

ccatacaatt 

atagctccac 

ttttttttga 

gctcactgca 

gggattacag 

acggagtctg 

ctccgactcc 

ggcgtgcacc 

gttggccagg. 

agtgctggga 

atggggtttc 

gccttggcct 

tcttcttggt 

tctattgttt 

gctggtttgg 

tattgatttg 

gttgagatgg 

ctgcagcctc 

gccaaggtgt 

ctttcactgc 

gacacagtct 

tgagctcctg 

tgtacaccac 

gctcacacct 

gagttcgaga 



cccatctcta 

gctacttgag 

gccaagatca 

aaaaaaaaaa 

aacttttttg 

ttacttgtag 

agtgtgttga 

tgttctgtca 

cctgggctca 

ccaccatgcc 

aggctggtct 

ttacaggggt 

ctttagatga 

cgtatgttga 

tctttattat 

aactcctggg 

tgagccactt 

cactctgtcg 

cgggttcaag 

actatgccca 

gctggtcttg 

tacaggcatg 

atcatgtaat 

gtattgataa 

gatgccagca 

agaggtaaaa 

agtgatatat 

ttggaagttt 

gctaaaatgt 

ctactgattc 

tggtctcgct 

ctacctcccg 

cgcacgccag 

tggccaggct 

tgctgggact 

tgcatgtttc 

gttcatagta 

tttctctctt 

gatggagtct 

acccctgcct 

gtgtgagcca 

tttctgtctc 

cagattccag 

accacgcctg 

ctggtcttgg 

ttataggcat 

tccctgtcgg 

cccaaagtgc 

cattttcgct 

tcccattctg 

gtttaatttg 

agatctcttt 

agtctccctc 

cgccgcccag 

ttctttttga 

attccataag 

cactctgttg 

gactcaatca 

tgcacctaac 

gtaatcccag 

ccagcctggc 
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cgacagggag 
tgcctgtaat 
agaggttgca 
ctccatctca 
caggctggtc 
gagattatgg 
cattttcatt 
ttattcaaga 
tttttttttt 
cttgcctcac 
agtagctggg 
acgtgct.ggt 
ccaaagcact 
atagagagaa 
tccctcctcg 
tttcaagact 
cacttcggga 
acatggtgaa 
gcctgtagtc 
gagctcgcag 
ccgtctcaaa 
tgagaaattg 
ttgctgtttt 
gtggatccct 
atgtctttca 
tttgtttctt 
tggtgtagca 
tcctcatttt 
ttctcaaatc 
gctccaaagt 
tttggttaga 
ttaaatttct 
gtcacccagg 
ttcaagtgat 
cacccaccta 
gtcttaaact 
cagatgtgag 
. gtttttgtct 
tatttttaaa 
ctaggctgga 
gcgattctcc 
ggctaatttt 
gatctcgtga 
gagccaccgc 
gctgtagtgc 
gtctcccacc 
ggttgtgttg 
ctcctgggct 
agccaccgca 
ttctttgcat 
ttactttgaa 
ttgtcgtttc 
tcaggggtgc 
tctttaaata 
atgctaggac 
gcctggccag 
tggcacctgc 
agggggcgga 
gcgagactct 
ccaggttgcc 
cctcagcctt 



aaaccccatc 
cccagctact 
gtgagccagg 
agaaaagaaa 
tcaaactcct 
gcaagagcca 
gatctcaaga 
gtgtctttat 
gagaaagggt 
tgcaacctct 
attacaggca 
cagactggtc 
gggattacag 
ggtctcactc 
gcctcccaag 
ttaaaaatgc 
ggccgaggtg 
accctgtctc 
ccagctactc 
tgagctgaga 
aaaaaaaaaa 
gctgttaatc 
aggagattct 
caatttataa 
ttaaatttgc 
cctgtctgtt 
caggtccctt 
ggataaattc 
tgctgttgaa 
ttctatttgg 
catggttctt 
ttttatttag 
ctggcaggca 
tctcctgcct 
atttttggta 
actaacctca 
ccactgtgcc 
agtcattcaa 
aaattttttc 
gtgcaatggc 
tgcttcagcc 
ttgtattttc 
cctcatgatc 
gcccagccta 
agtggcacaa 
ttagtctcct 
ttactgtttt 
caagcagtca 
cccggcctgt 
gccttgtaat 
aatcagatcc 
tcgtttgttt 
ttgtttctgt 
tctagtggca 
tttgggaagt 
tatggtgaaa 
ctgtagtccc 
ggttgcagtg 
gtctcaaata 
tctgtactgg 
tacttctcgc 



tctactaaaa 
caggaggctg 
atcacaccat 
aaaaaaagtt 
gacttgaagg 
ccgcaccccg 
catcctaatc 
ttctgcatat 
cttgctctgt 
gcctcccggg 
cacaccacca 
tcgaattcct 
gcgtgaaaca 
aagcccaggc 
gtgctgagat 
catcttggct 
ggcagatcac 
tactaaaaat 
gggaagctga 
tcacaccact 
aaaatgtcat 
ttattgagga 
ctatctttgg 
gctacttgga 
aaagtttcag 
tgccgcttat 
gggctctagg 
agctgacctg 
acttctggtg 
tttctttctg 
ttgttttcct 
ttattaattt 
acgtcacaat 
cagcctccca 
tttttagtag 
ggtgatctgt 
cagcctcttt 
atgtctaggc 
taattttctt 
ttgatcttgg 
tcctgagtag 
agtagagaca 
ctcctgcctc 
ttttttattt 
ccctggcaca 
gag t age tag 
agacacaggg 
tcccaccttg 
taatttcttt 
tttttgttga 
tccgccctct 
agttactttc 
tcttttaggt 
aaaaggataa 
ggaggcgggt 
ccctgtctc.t 
agctactggg 
agctgagatt 
aaaaaaaaaa 
ggcaagtcct 
tcccatggag 



atacaaaaat 
aggcaggaga 
tgcactccag 
tttgcagaga 
agtcctactg 
ccacttggct 
tcccttttgt 
ttgtaaattt 
cgcccaggct 
ttcaagtgat 
cacctggcta 
gacctcaggt 
ccatgcccag 
tggtcttgaa 
tacaggtgtg 
gggcacggtg 
gaggtcagga 
acaaaaaatt 
agcaggagaa 
gtactccagc 
ctcactgcct 
acatttatat 
gtttcagcag 
gttcattgga 
ctactattct 
ggaactttat 
catttttctt 
tcctcaagtt 
aaattttcac 
tagtaattat 
ttagttcatt 
tttttttttt 
cttggctcac 
agtagctggg 
aaactgggtt 
ccgcctcagc 
ttttagtgta 
ttcctcagga 
ttttttttag 
ctcactgcaa 
ctgggactat 
tggttttgcc 
ggcctcccaa 
tttgagacaa 
ctgcagcctt 
aactacaagc 
tcttgctaca 
gcctcccaag 
atttccggtg 
aacctgcaca 
gcagggttca 
ctgacctttt 
tagtggttag 
agaggccggg 
ggatcacttg 
actaaaaata 
aagactgagg 
gcgccattgc 
aaaaaggata 
tcagtgtccg 
cctaaggatg 



tagctgggcg 
atcgcttgaa 
cctgggtaac 
cag.ggtatca 
cctcagcctc 
gttttgttct 
ttttttgttc 
tccaaaaaag 
ggagaatggt 
cctcccacct 
atttttgtat 
gatctgcccg 
cccccaattt 
ctcctgagct. 
agtcacagta 
gctcacgcct 
gatcaagacc 
aaccaggtgt 
tggcg.tgaac 
ctgggcaaca 
tctggtccaa 
attgactagt 
tttgattata 
cttcttggat 
ttgcatcttg 
gcatacattg 
tgttcttttt 
cactgtttct 
tacagttact 
cactttacta 
atccatggtt 
tgaagcgggg 
tacaacctcc 
attataggca 
tcaccacatt 
ctcccaaaat 
tttaaggtaa 
acagtttcta 
atggagtctc 
cctctgcctc 
aggtgcgtgc 
gtgttagcca 
agtgctggaa 
agtctccctc 
aaccgtccag 
atgtgccacc 
tttctctgac 
gtgttgagat 
aatgggccac 
atttgaagat 
ttgttgctgt 
taaataaaga 
cttgtgcttt 
cgcagtggct 
aggtcaggag 
caaaaattaa 
caggagaatc 
actccagcct 
aagagtgtct 
ccaggctgtt 
aaccagaggt 



tggtggcagg 
cctgggaggc 
aaaagcaaaa 
ctttgttgcc 
ccaaagtgct 
gttgtatttc 
gacttactgg 
tttttctttc 
ggtgcacaat 
cagccttccc 
tttagtctta 
ccttggcctc 
ttttttttta 
caagctgtca 
cctggccttc 
gtaatcccag 
"accctggcta 
ggtggcaggt 
ccgggaggtg 
gtgcgagact 
tagtttctga 
cacttgtctc 
atgtatcagt 
gtgtaaattc 
aaatactagt 
atgtgcttca 
ttctttctgc 
ttcttcttcc 
gtacttttta 
gtattctcta 
tcctttattt 
tttcactctt 
gcctcctggg 
tgtgccacca 
ggccagactg 
gctgggatta 
ttgattgaaa 
ttaatttctt 
actctatagc 
ctgggttcaa 
caccactcct 
ggatggtctc 
ttacaggtgt 
tctcacccag 
gcttaagtga 
atgcctggct 
tggtcttgaa 
tacaggtgtg 
actttcttgt 
gataatgtgg 
ttgttgtgga 
ctatattctg 
gaaagagatt 
cacgcctgta 
tttaagatca 
ccgggcatgg 
gcttcaatcc 
gggcaacaga 
tccatccttt 
cacggctttt 
gaaag.ttgag 
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ggcctcctca 
tttccaagca 
gcctcttcct 
acaacttcag 
gttatcccag 
ccagcctggc 
tggtgggtgc 
gggaggcgga 
gtgagactcc 
aggctgcccc 
acactagaaa 
tctgacacca 
tggtagtagt 
cttcattgag 
ttttgacagt 
gtttactgtg 
gctgccaact 
ttttaagtta 
caatgtactt 
agtttcactc 
cgcctcccag 
acccccacca 
tcaggctggt 
tgggattaca 
gatctacttc 
acctcccttt 
ctctggcata 
gacacagctc 
tagctccatc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctctga 
ctctgacata 
gacatagctc 
tagctccatc 
ttgccatata 
tcaaaagtat 
ttttgggggc 
catgacgaaa 
ctgtagtccc 
ggaagtcaag 
ttatctcaga 
gaggctgagg 
tgaaaccccg 
atcccagcta 
cagtgagcaa 
aaaaaaaaaa 
aatcccaaag 
agcctgggca 
aaaaagtata 
ttctttcctg 
atttagaatt 
aatggtgcag 
tcagcctccc 
gtatttttag 
tcatgatctg 
gcccatttag 
aggctggagt 
cattctcctg 
ctaatttttt 



ggtgtttctg 
tgaacaggag 
tcctaggctt 
gtag.tatcta 
cactttggga 
caacatggtg 
ctgtaatctc 
ggttgcagtg 
atctcgggga 
agccctgaga 
gatccaaatg 
gcaagccaca 
ggctgagcag 
cagtcatctg 
ttaatagaaa 
acagttttga 
ccatcatttt 
Ctttcttagt 
cagattatta 
tgtcacccag 
gttcacgcca 
cacccggcta 
ctcgaactgc 
ggtgtgagcg 
cgttgtacta 
tcctccgaca 
gctccaccct 
catcctccct 
ctccctcctc 
cttctcctct 
tcctatgaca 
atgacacagc 
catagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
tacatttatg 
tttgaaagaa 
tgaggtgggc 
cccatctcca 
agctatttgg 
gctgcagtga 
agtaaaggga 
tgagtggatc 
tctctactaa 
ctcgggaggc 
gatcgcacca 
aaaaagaaaa 
ccttgggagg 
acatagcgag 
cttgtttttt 
tgaatttgag 
tttttttttt 
tctcagatca 
aagtagctga 
tagagacagg 
cctgtcctgg 
aatttctttt 
gcagtggcac 
cctcagcctc 
gtatttttag 



agcccctgtc 
ctttccaaag 
ttctgtcagc 
cctgtaaatg 
ggccgaggcg 
aagccccgtc 
agctactcgg 
agctgagatc 
aaaaaaaaaa 
gagttcgagg 
cataagcagg 
tcagaaatac 
aaatagccca 
gttcgtagac 
aaagtttatt 
cagcagaata 
ggtgatatcc 
ggttgctcta 
ctaacttaac 
gctggagtgc 
ttctcctgcc 
attttgtatt 
tgacctcagg 
actgcactga 
tgacatagtt 
tagttccatc 
cccttctcct 
tctcctatga 
ctatgtcata 
ggcatagctc 
cagctccatc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctctga 
ctctgacata 
tatgttataa 
gttgcctgga 
agatcgcctg 
ccaaaattac 
gggaggatcc 
gctgagattg 
ctaggaatgg 
accggaggtc 
aaatacaaaa 
tgaggcaaga 
ctgcattaca 
gaaaagaaat 
ccaaggcagg 
actccatctc 
tgtatcatcc 
ttactgtcta 
tttgagacaa 
ctgcaacctc 
gactgcaggc 
gtttcactat 
cctcccaaaa 
tttttttttt 
gatctcggct 
ccgagtagct 
gagagatggg 



tagccccagc 
cccttagacc 
tctttgccca 
ccttcaggcc 
ggtgaattgc 
tctagtaaaa 
gaggctgaag 
gtgccattgc 
aaaaaaatgc 
gggtgaaaca 
attccttgag 
aggttgcctt 
gctgtcctcc 
accagagtta 
gtgacagttt 
gttgctttgc 
agctctgttg 
gagatgacaa 
acttaaagta 
aatggtgtga 
tcagcctcct 
tttagtagag 
tgatccgccc 
gcctaagtat 
ccccctccat 
ctccctcctc 
atgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacatagc 
catagttcca 
gctccatccc 
cttcagctct 
ggcagtggct 
agccagggag 
.aaaaaattag 
cagctaaggt 
tgccactgca 
tggcttttat 
aggagtttaa 
agtagccggg 
gaatcgcttg 
gcctagatga 
taagatctag 
aggatcactt 
tatttaaaaa 
atattttacc 
gtgtcatttc 
agtctcactg 
tgcctccctg 
acctgccacc 
gttggccagg 
tgctgggatt 
ttttgagatg 
cactgcgagc 
gggattacag 
gtttcaccat 



tgtgtgcatg 
ttcatgtagc 
tctgttgttg 
aggcgcggtg 
ttgaggtcag 
atacaaaaat 
caggagaatt 
actccagcct 
catcaacagc 
aaggcaagcc 
aaaaggtctg 
ccccatggct 
tgaaatttag 
cagaaaagtt 
tgacagctga 
tggagagacg 
ctgaattttt 
tgtgcatctt 
cagcattttt 
tctcggctca 
gagtagctgg 
atgaggtttc 
atcttggcct 
ggcaacgtgt 
tttcctatag 
ctatgacgtc 
tccatcctcc 
tcctcccttc 
cccttctcct 
tctcctatga 
ctatgacaca 
ggcatagctc 
tagctccatc 
tccatcctcc 
tcctcccttg 
ctcttctcct 
tcagcgttat 
tatgccttta 
ttggagacca 
tctggcatgg 
gggaggatca 
ctccagcctg 
ctctaatccc 
gaccagcctg 
tgtggtggtg 
aacctgggaa 
cagagcgaga 
acactgtggt 
gaggccagga 
agaaagaaat 
tttctttttt 
cttttagtct 
tgttgcccag 
gttagagtga 
acccccagcc 
ctggtctcga 
accatgagcc 
gggtctcgct 
tccgcctccc 
gcgcctgcca 
gttagccagg 



gccttctgga 
tcttttccca 
tccctccccc 
gctcatacct 
gagttcgaga 
tagctgggcg 
gcttgagcct 
gggcgacaga 
acgaccctgg 
cttcagggag 
tatcatccct 
acatgtgagc 
cagggtctta 
tattgggagg 
atagaaaaaa 
gatctttgga 
agctatgctg 
taacttacca 
ttttttatgg 
ctgcaacctc 
gactacaggc 
accatgttgg 
cccaaagtgc 
ctataacata 
cacagtccca 
ctcccttctc 
cttctcctct 
tcctctgaca 
ctgacacagc 
cacagctcca 
gctccatcct 
catcctccct 
ctcccttctc 
cttctcctct 
tcctctgaca 
tcatgtatta 
aattattgct 
actccagcac 
gcctgggcaa 
tggcacgcgc 
cttgagcctg 
ggtgcagatc 
agcactttgg 
gccaacatgg 
ggtgtctgta ' 
gcggaggttg. 
ctctgcctaa 
tcatgcctgt 
gttcaacacc 
tcaaagagaa 
ttgccccttt 
gaagaacttc 
gctggagtgc 
ttttcctgcc 
aatttttttg 
attcatgacc 
accacgccca 
cttgtttccc 
gggttcacgc 
ccacgcccac 
atggtcttga 



110640 

110700 

110760 

110820 

110880 

110940 

111000 

111060 

111120 

111180 

111240 

111300 

111360 

111420 

111480 

111540 

111600 

111660 

111720 

111780 

111840 

111900 

111960 

112020 

112080 

112140 

112200 

112260 

112320 

112380 

112440 

112500 

112560 

112620 

112630 

112740 

112800 

112860 

112920 

112980 

113040 

113100 

113160 

113220 

113280 ' 

113340 

113400 

113460 

113520 

113580 

113640 

113700 

113760 

113820 

113380 

113940 

114000 

114060 

114120 

114130 

114240 
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tctcctgacc tcgtgatccg cccgccttgg cctcccaaag tgctgggatt acaggcgtga 114300 

gccaccgcgc ccggctagaa tttcttgtag gacaggctcg ctagcaacca attcagtgtt 114360 

tatttgggaa tgtctttatt tcagcttcac tttttgaagg atagtttagc tggctataga 114420 

attattaatt gatcattctt ttcagtgttt aaaagtgtca tcatgctacc ttctgggttc 114480 

cattgtttct gatgagaagt catctgtcaa attgtccctt tgtacttgaa gaattatctt 114540 

cttttctctt gatgttttca agattttctc Cttgtctttg gcctttag.ta gtttgtgatg 114600 

catctaggtg tggatctctt ggcgtgcatc gtatttgggc ttcagtaagc ctcttagatt 114660 

catagattaa tgttttgttt tgttttacca aatttggaga gtttttactc atcatttcaa 114720 

caaatttttt tcctgcccct ctctcatctc cttttgggag caccaccgca. tgtatgttgg 114780 

tgtgcgttct eta 1147.93 



<210> SEQ ID N0:4 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM :Artif Icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:4 
cacaggttca gcatgtttgt gegtc. 

<210> SEQ ID NO: 5 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 



<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 5 
cacagtccct gctggcctct gtcta 

<210> SEQ ID NO: 6 
<211> LENGTH: 25 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 6 
caggacatct ccatcaagag getge 

<210> SEQ ID NO: 7 

<211> LENGTH: 25 

<212> TYPE: DNA 

<213> ORGANISMrArtificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:7 
aataagaggg ggecaggate agtgc 

<210> SEQ ID NO: 8 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 8 
gtgaatggca tcctggagag 

<210> SEQ' ID NO: 9 
<211> LENGTH :. 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 9 
gtctccaggc agctcaacag 

<210> SEQ ID NO: 10 

<211> LENGTH: 20 

<212> TYPE: DNA 

<213> ORGANISM:Arti.ficial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION : oligonucleotide 

<400> SEQ ID NO: 10 
accctgtccc tcctgtctga 

<210> SEQ ID NO: 11' 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 11 
agaccctaag atgttcggag 

<210> SEQ ID NO: 12 
<211> LENGTH: 20 
<212> TYPE: DNA . 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 12 
gatgacctgt gtgagttgcg 

<210> SEQ ID NO:13 

<211> LENGTH: 20 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 13 
cgcaactcac. acagg.tcatc. 
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<210> SEQ ID NO: 14 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<4 00> SEQ ID NO: 14 
ggagtcaggt caaaggatgc 

<210> SEQ ID NO: 15 

<211> LENGTH: 20 

<212> TYPE: DMA 

<213> ORGANISM : Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide , 

<400> SEQ ID NO: 15 
gcatcctttg acctgactcc 

<210> SEQ ID NO: 16 
<211> LENGTH: 20 
<212>.TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<40O> SEQ ID NO: 16 
ggtctgaaac gtgatctggg 

<210> SEQ ID NO:17 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 

.'<220> FEATURE: 
<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 17 
cccagatcac gtttcagacc 

<210> SEQ ID NO: 13 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 18 
cgatgatgtg. tgggttctcc 

<210> SEQ ID NO: 19 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence . 
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<22 0> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:19 
ggagaaccca cacatcatcg 

<210> SEQ ID NO:20 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : oligonucleotide 

<400> SEQ ID NO:20 
cgtgtctgag aagtccagcc 

<210> SEQ ID NO: 21 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:21 
ggctggactt ctcagacacg 

<210> SEQ ID NO:22' 
<211>. LENGTH: 20 
<212> TYPE: DNA 

<213> ORGAN ISM:Artificial Sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION : oligonucleotide 

<400> SEQ ID NO:22 
acagcatctt ctccacgcac 

<210> SEQ ID NO:23 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence . 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 23 
agtcctctgg ctttgcagtg 

<210> SEQ ID NO:24 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213>' ORGANISM: Artificial Sequence' 
<220> FEATURE: 

<223> OTHER INFORMATION : oligonucleotide 

<400> SEQ ID NO: 24 
, tgtgcgtgga gaagatgctg 
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<210> SEQ ID NO: 25 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 25 
ggctggaaag ggaagcctac 

<210> SEQ ID NO: 26 
<211> LENGTH: 20 
<212> TYPE : DNA 

<213> ORGANISM : Artificial, Sequence ^ 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 26 
tggttcaggt gctcttgggg 

<210> SEQ ID NO: 27 
<211> LENGTH: 20 • 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 27 
cgtgaagcag gagttgagcc 

<210> SEQ ID NO:28 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 28 
atcttgctct gggtcttccc 

<210> SEQ ID NO: 29 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artif,icial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO: 29 
cactgcaaag ccagaggact 

<210> SEQ ID NO: 30 
<211> LENGTH: 20 
<212> TYPE:- DNA 

<213> ORGANISM: Art if icial Sequence 
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<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:30 
ataagcaaga cgacgacctc 

<210> SEQ ID NO: 31 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:31 
ctattctgtt gggtgggttc 

<210> SEQ ID NO: 32 
<211> LENGTH: 20 
<21-2> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:32 
cgtgcctcct gtgcttaccc 

<210> SEQ ID NO:33 ' 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM:Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:33 
cagaccccaa ggtagctcag 

<210> SEQ ID NO:34 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: oligonucleotide 

<400> SEQ ID NO:34 
ggaagaccca gagcaagatc 

<210> SEQ ID NO:35 
<211> LENGTH: 780. 
<212> TYPE: PRT 

<213> ORGANISM:Dictyostelium discoideum 
<400> SEQ ID NO:35 

Met Lys Phe Tyr lie Glu Asp Leu Leu Val Tyr Phe Pro Tyr Ser Tyr 

1 5 10 15 

He Tyr Pro Glu Gin Tyr- Ser Tyr Met Val Ala Leu Lys Arg Ser Leu 
20 ' 2S 30 



46 



WO 01/42434 



PCT/USOO/33065 



Asp Asn Gly Gly Pro Cys lie Leu Glu Met Pro Ser Gly Thr Gly Lys 

35 40 45 

Thr val Ser Leu Leu Ser Leu lie Ser Ser Tyr Gin Val Lys Asn Pro 

50 55 60 

Ser He Lys Leu He Tyr Cys Ser Arg Thr Val Pro Glu He Glu Gin 
65 70 75 80 

Ala Thr Glu Glu Ala Arg Arg Val Leu Gin Tyr Arg Asn Ser Glu Met 

85 90 95 

Gly Glu Glu Ser Pro Lys Thr Leu Cys Met Ser Met Ser Ser Arg Arg 

100 105 HO 

Asn Leu Cys He Gin Pro Arg Val Ser Glu Glu Arg Asp Gly Lys Val 

115 120 125 

Val Asp Ala Leu Cys Arg Glu Leu Thr Ser Ser Trp Asn Arg Glu Ser 

130 135 140 

Pro Thr Ser Glu Lys Cys Lys Phe Phe Glu Asn Phe Glu Ser Asn Gly 
145 150 155 160 

Lys Glu He Leu Leu Glu Gly Val Tyr Ser Leu Glu Asp Leu Lys Glu 

165 170 175 

Tyr Gly Leu Lys His Gin Met Cys Pro Tyr Phe Leu Ser Arg His Met 

130 185 190 

Leu Asn Phe Ala Asn He Val He Phe Ser Tyr Gin Tyr Leu Leu Asp 

195 200 205 

Pro Lys He Ala Ser Leu lie Ser Ser Ser Phe Pro Ser Asn Ser He 

210 215 220 

Val Val Phe Asp Glu Ala His Asn He Asp Asn Val Cys He Asn Ala 
225 230 235 240 

Leu Ser lie Asn lie Asp Asn Lys Leu Leu Asp Thr Ser Ser Lys Asn 

245 250 255 

He Ala Lys He Asn Lys Gin He Glu Asp He Lys Lys Val Asp Glu 

260 265 270 

Lys Arg Leu Lys Asp Glu Tyr Gin Arg Leu Val Asn Gly Leu Ala Arg 

275 280 285 

Ser Gly Ser Thr Arg Ala Asp Glu Thr Thr Ser Asp Pro Val Leu Pro 

290 295' 300 

Asn Asp Val lie Gin Glu Ala Val Pro Gly Asn He Arg Lys Pro Ser 
305 310 315 320 

He Phe lie Ser Leu Leu Arg Arg Val Val Asp Tyr Leu Arg Glu Pro 

325 330 335 

Asp Lys Ser Arg Leu Lys Ser Gin Met Leu Leu Ser Glu Ser Pro Leu 
340 345 350 

. Ala Phe Leu Gin Gly Leu Tyr His Ala Thr Gin He Ser Ser Arg Thr 
355 360 365 

Leu Arg Phe Cys Ser Ser Arg Leu Ser Ser Leu Leu Arg Thr Leu Arg 

370 375 380 

He Asn Asp Val Asn Gin Phe Ser Gly lie Ser Leu He Ala Asp Phe 
385 390 395 ■ 400 

Ala Thr Leu Val Gly Thr Tyr Asn Asn Gly Phe Leu He He He. Glu 

405' 410 415 

Pro Tyr Tyr Gin Arg Gin Asn Asn Thr Tyr Asp Gin He Phe Gin Phe 

420 425 430 

Cys Cys Leu Asp Ala Ser He Gly Met Lys Pro lie Phe Asp Lys Tyr 

435 440 445 

Arg Ser Val Val lie Thr Ser Gly Thr Leu Ser Pro Leu Asp He Tyr 

450 455 . 460 

Thr Lys Met Leu Asn Phe Arg Pro Thr Val Val Glu Arg Leu. Thr Met 
465 470 475 . 480 

Ser Leu Asn Arg Asn Cys lie Cys Pro Cys lie Leu Thr Arg Gly Ser 

485 490 495 

Asp Gin lie Ser lie Ser Thr Lys Phe Asp Val Arg Ser Asp Thr Ala 
500 505 510 



47 



WO 01/42434 



PCT/US00/33065 



vai 


vai 


Arg 


Asn 


Tyr 


oiy 


A 1 a 
nla 


Leu 


Leu 


Val 


Glu 


Val 


Ser 


Ala 


He 


Val 






m ^ 

jIj 




















525 








Drn 
rlo 


Asp 


biy 


lie 


Tl p 
lie 


uys 


true 


Phe 


Thr 


Ser 


Tyr 


Ser 


Tvr* 
xy i 


Met 


Glu 


Gin 




DjU 










c.m 

J J J. 










540 










lie 


val 


Ser 


Val 


Trp 


Asn 


n n 

U-L U 


Mot" 
del. 


CZ ] v 
oiy 


Leu 


Leu 


Asn 


Asn 


He 


Leu 


Thr 


C >1 c 










^ n 










55S 










J O ly 


Asn 


Lys 


Leu 


lie 


rfie 


vai 


CjlU 


mr 


Co-r* 
icl 


Asp 


Pro 


Ala 


n n 

\s± Li 


c or 
Oci 


.ft -La 


Leu 










c c c 










c. 1 










J / j 




Ala 


Leu 


Gin 


Asn 


Tyr 


Lys 


Lys 


Aia 


Cys 


Asp 


Ser Gly 


Arg. 


oiy 


nla 


vol 








con 










Do j 










c. Q C\ 






Leu 


Leu 


Ser 


Val 


Ala 


Arg 


.Gly 


Lys 


vai 


oer 


Glu 


Gly 


Tie 

lie 


Asp 


rue 


Asp 






595 










600 










bUD 








Asn 


Gin 


Tyr 


Gly 


Arg 


Cys 


Vai 


He 


Leu 


Tyr 


Gly 


lie 


Pro 


Tyr 


lie 


Asn 




610 










615 










620 










Thr 


Glu 


Ser 


Lys 


Val 


Leu 


Arg 


Ala 


Arg 


Leu 


Glu 


Phe 


Leu 


Arg 


Asp- 


Arg 


625 










630 










635 










640 


Tyr 


Gin. 


lie 


Arg 


Glu 


Asn 


Glu 


Phe 


Leu 


Thr 


Phe 


Asp 


Ala 


Met 


Arg 


i nr 










645 










c c r\ 

650 










c c c 




Ala 


Ser 


Gin 


Cys 


Val 


Gly 


Arg 


Val 


lie 


Arg 


Gly Lys 


Com- 
ber 


Asp 


Tyr 


bly 








660 










665 










c ~i n 
b / U 






lie 


Met 


lie 


Phe 


Ala 


Asp 


Lys 


Arg 


Tyr 


Asn 


Arg 


Leu 


Asp 


Lys 


Arg 


Asn 






675 










6.80 










for 
OoD 








Lys 


Leu 


Pro 


Gin 


Trp 


He 


Leu 


Gin 


Phe 


Cys 


Gin 


Pro 


bin 


nlS 


Leu 


Asn 




690 










695 










700 










Leu 


Ser 


Thr 


Asp 


Met 


Ala 


He 


Ser 


Leu 


Ser 


Lys 


Thr 


Phe 


Leu 


Arg 


Glu 


705 










710 










715 










"7 O fl 


Met 


Gly 


Gin 


Pro 


Phe 


Ser 


Arg 


Glu 


CjlU 


bin 


Leu Gly 


Lys 


JCi 


Leu 


Trp 










725 










/ J u 










/ jj 




Ser 


Leu 


Glu 


His 


Val 


Glu 


Lys 


Gin 


Ser 


Thr 


Ser 


Lys 


Pro 


Pro 


Gin 


Gin 








740 










745 










750 






Gin 


Asn 


Ser 


Ala 


He 


Asn 


Ser 


Thr 


He 


Thr 


Thr 


Ser 


l nr 


Thr 


Thr 


Thr 






755 










760 










/ fa J 








Thr 


Thr 


Thr 


Ser 


Thr 


lie 


Ser 


Glu 


Thr 


His 


Leu 


Thr 












770 










775 










780 










<210> SEQ ID NO 


:36 
























<211> LENGTH: 778 
























<212> TYPE: 
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<213> ORGANISM: S. cerevisiae 
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He 
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He 


Ser 
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Glu 


Glu 


Gly 


Phe 
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Leu 
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He 


He 


Glu 


Pro 
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Glu 


He 


Glu 
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Val 


Pro 


Asn 








420 
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He 
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He 


Ala 


He 
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Pro 
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Arg 


Met 


Leu 


Asn 


Phe 


Lys 


Thr 


Val 


Leu 


Gin 


465 
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lie Pro Phe Gin Tyr Thr Glu Ser Arg lie Leu Lys Ala Arg Leu Glu 
625 630 635 640 

Phe Met Arg Glu Asn Tyr Arg lie Arg Glu Asn Asp Phe Leu Ser Phe 

645 650 655 

Asp Ala Met Arg, His Ala Ala Gin Cys Leu Gly Arg Val Leu Arg Gly 

. 660 665 670 

Lys Asp Asp Tyr Gly Val Met Val Leu Ala Asp Arg Arg Phe Ser Arg 

675 680 ' 685 

Lys Arg Ser Gin Leu Pro Lys Trp lie Ala Gin Gly Leu Ser Asp Ala 

690 " 695 700 

Asp Leu Asn Leu Ser Thr Asp Met Ala lie Ser Asn Thr Lys Gin Phe 
705 710 715 . 720 

Leu Arg Thr Met Ala Gin Pro Thr Asp Pro Lys Asp Gin Glu Gly Val 

725 . 730 735 

Ser Val Trp Ser Tyr Glu Asp Leu lie Lys His Gin Asn Ser. Arg Lys 

740 745 750 

Asp Gin Gly Gly Phe lie Glu Asn Glu Asn Lys Glu Gly Glu Gin Asp 

755- 760 . 765 

Glu Asp Glu Asp Glu Asp lie Glu Met Gin 
• 770 775 

<210> SEQ ID MO: 37 
<211> LENGTH: 772 
<212> TYPE: PRT 
<213> ORGANISM: S . pombe 

<400> SEQ ID NO: 37 

Met Lys Phe* Tyr lie Asp Asp Leu Pro lie Leu Phe Pro Tyr Pro Arg 

1 5 - 10 15 

He Tyr Pro Glu Gin Tyr Gin Tyr Met Cys Asp Leu Lys His Ser Leu 

20 25 30 

Asp Ala Gly Gly He Ala Leu Leu Glu Met Pro Ser Gly Thr Gly Lys 

35 40 45 

Thr He Ser Leu Leu Ser Leu He Val Ser Tyr Gin Gin His Tyr Pro 

.50 55 60 

Glu His Arg Lys Leu He Tyr Cys Ser Arg Thr Met Ser Glu He Asp 
65 70 75 80 

Lys Ala Leu Ala Glu Leu Lys Arg Leu Met Ala Tyr Arg Thr Ser Gin 

85 90 95 

Leu Gly Tyr Glu Glu Pro Phe Leu Gly Leu Gly Leu Thr Ser Arg Lys 

100 105 110 

Asn Leu Cys Leu His Pro Ser Val Arg Arg Glu Lys Asn Gly Asn Val 

115 120 125 

Val Asp Ala Arg Cys Arg Ser Leu Thr Ala Gly Phe Val Arg Glu Gin 

130 135 140 

Arg Leu Ala Gly Met Asp Val Pro Thr Cys Glu Phe His Asp Asn Leu 
145 150 155 160 

Glu Asp Leu Glu Pro His Ser Leu lie Ser Asn Gly Val Trp Thr Leu 

165 170 175 

Asp Asp He Thr Glu Tyr Gly Glu Lys Thr Thr Arg Cys Pro Tyr Phe 

180 185 190 

Thr Val Arg Arg Met Leu Pro Phe Cys Asn Val He lie Tyr Ser Tyr 

195 200 205 

His Tyr Leu Leu Asp Pro Lys He .Ala Glu Arg val Ser Arg Glu Leu 

210 215 220 

Ser Lys Asp Cys lie Val Val Phe Asp Glu Ala His Asn lie Asp Asn 
225 . 230 235 240 

Val Cys lie Glu Ser Leu Ser He Asp Leu Thr Glu Ser Ser Leu Arg 
245 250 255 
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Lys Ala. Ser Lys Ser He Leu Ser Leu Glu Gin Lys Val Asn Glu Val 

260 265 270 

Lys Gin Ser Asp Ser Lys Lys Leu Gin Asp Glu Tyr Gin Lys Leu Val 

275 280 ■ 285 

Arg Gly Leu Gin Asp Ala Asn Ala Ala Asn Asp Glu Asp Gin Phe Met 

290 295 300 

Ala Asn Pro Val Leu Pro Glu Asp Val Leu Lys Glu Ala Val Pro Gly 
305 310 315 320 

Asn He Arg Arg Ala Glu His Phe He Ala Phe Leu Lys Arg Phe Val 

325 330 335 

Glu Tyr Leu Lys Thr Arg Met Lys Val Leu His Val lie Ala Glu Thr 

340 345 350 

Pro Thr Ser Phe Leu Gin His Val Lys Asp lie Thr Phe He Asp Lys 

355 360 365 

Lys Pro Leu Arg Phe Cys Ala Glu Arg Leu Thr Ser Leu Val Arg Ala 

370 375 380 

Leu Gin He Ser Leu Val Glu Asp Phe His Ser Leu Gin Gin Val Val 
385 390 395 400 

Ala Phe Ala Thr Leu Val Ala Thr Tyr Glu Arg Gly Phe. He Leu He 

405 410 415 

Leu Glu Pro Phe Glu Thr Glu Asn Ala Thr Val Pro Asn Pro He Leu 

420 .425 430 

Arg Phe Ser Cys Leu Asp Ala Ser He Ala He Lys Pro Val Phe Glu 

435 440 445 

Arg Phe Arg Ser Val He He Thr Ser Gly Thr Leu Ser Pro Leu Asp 

450 455 460 

Met Tyr Pro Lys Met Leu Gin Phe Asn Thr Val Met Gin Glu Ser Tyr 
465 470 475 480 

Gly Met Ser Leu Ala Arg Asn Cys Phe Leu Pro Met Val Val Thr Arg 

485 490 495 

Gly Ser Asp Gin Val Ala He Ser Ser Lys Phe Glu Ala Arg Asn Asp 

500 505 510 

Pro Ser Val Val Arg Asn Tyr Gly Asn He Leu Val Glu Phe Ser Lys 

515 520 525 

He Thr Pro Asp Gly Leu Val Ala Phe Phe Pro Ser Tyr Leu Tyr Leu 

530 535 540 

Glu Ser He Val Ser Ser Trp Gin Ser Met Gly lie Leu Asp Glu Val 
545 550 555 560 

Trp Lys Tyr Lys Leu He Leu Val Glu Thr Pro Asp Pro His Glu Thr 

' 565 570 575 

Thr Leu Ala Leu Glu Thr Tyr Arg Ala Ala Cys Ser Asn Gly Arg Gly 

580 585 590 

Ala Val Leu Leu Ser Val Ala Arg Gly Lys Val Ser Glu Gly Val Asp 

595 60.0 605 

Phe Asp His .His Tyr Gly Arg Ala Val He Met Phe Gly He Pro Tyr 

610 615 620 

Gin Tyr Thr Glu Ser Arg Val Leu Lys Ala Arg Leu Glu Phe Leu Arg 
625 630 ' 635 640 

Asp Thr Tyr Gin He Arg Glu Ala Asp Phe Leu Thr Phe Asp Ala Met 

645 " 650 655 

Arg His Ala Ala Gin Cys Leu Gly Arg Val Leu Arg Gly Lys Asp Asp 

660 665 670 

His Gly He Met Val Leu Ala Asp Lys Arg Tyr Gly Arg Ser Asp Lys 

675 680 685 

Arg Thr Lys Leu Pro Lys Trp lie Gin Gin Tyr lie Thr Glu Gly Ala 

690 695 700 

Thr Asn Leu Ser Thr Asp Met Ser Leu Ala Leu Ala Lys Lys Phe Leu' 
705 710 • 715 720 

Arg Thr Met Ala Gin Pro Phe Thr Ala Ser Asp Gin Glu Gly lie Ser 
725 730 735 
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Trp Trp Ser Leu Asp Asp Leu Leu He His Gin Lys Lys Ala Leu Lys 

740 745 750 

Ser Ala. Ala He Glu Gin Ser Lys His Glu Asp Glu Met Asp He Asp 

755 760 765 

Val Val Glu- Thr 
770 

<210> SEQ ID NO: 38 

<211> LENGTH: 760 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapien 



<400> SEQ ID NO:38 

Met Lys Leu Asn Val Asp Gly Leu Leu Val Tyr Phe 

1 5 10 

He Tvr Pro Glu Gin Phe Ser Tyr Met Arg. Glu Leu 

20 25 
Asp Ala Lys Gly His Gly Val Leu Glu Met Pro Ser 

35 40 
Thr Val Ser Leu Leu Ala Leu He Met Ala Tyr Gin 

50 55 60 

Leu. Glu Val Thr Lys Leu He Tyr Cys Ser Arg Thr 
65 70 75 

Glu Lys Val He- Glu Glu Leu Arg Lys Leu Leu Asn 

85 90 
Gin Glu Gly Glu Lys Leu Pro Phe Leu Gly Leu Ala 

100 105 
Lys Asn Leu Cys He His Pro Glu Val Thr Pro Leu 

115 ' 120 

Asp Val Asp Gly Lys Cys His Ser Leu Thr Ala Ser 
130 135 140 

Gin Tyr Gin His Asp Thr Ser Leu Pro His Cys Arg 
145 150 155 

Phe Asp Ala His Gly Arg Glu Val Pro Leu Pro Ala 

165 170 
Leu Asp Asp Leu Lys Ala Leu Gly Arg Arg Gin Gly 

180 185 
Phe Leu Ala Arg Tyr Ser He Leu His Ala Asn Val 

195 200 
Tyr His Tyr Leu Leu Asp Pro Lys He Ala Asp Leu 
210 215 220 

. Leu Ala Arg' Lys Ala Val - Val Val Phe Asp Glu Ala 
225 230 235 

Asn Val Cys He Asp Ser Met Ser Val Asn Leu Thr 

245 250 
Asp Arg Cys Gin Gly Asn Leu Glu Thr Leu Gin Lys 

260 265 
He Lys Glu Thr Asp Glu Gin Arg Leu Arg. Asp Glu 

275 280 
Val Glu Gly Leu Arg Glu Ala Ser Ala Ala Arg Glu 
290 295 300 

Leu Ala Asn. Pro Val Leu Pro Asp Glu Val Leu Gin 
305 310 315 

Gly Ser lie Arg Thr Ala Glu His Phe Leu Gly Phe 

325 330 
Leu Glu Tyr Val Lys Trp Arg Leu Arg Val Gin His 

340 345 
Ser Pro Pro Ala- Phe Leu Ser Gly Leu Ala Gin Arg 

355 360 
Arg Lys Pro Leu Arg Phe Cys Ala Glu Arg Leu Arg 



Pro Tyr Asp Tyr 
15 

Lys Arg Thr Leu 
30 

Gly Thr Gly Lys 
45 

Arg Ala Tyr Pro 

Val Pro Glu He 
80 

Phe Tyr Glu Lys 
95 

Leu Ser Ser Arg 
110 

Arg Phe Gly Lys 
125 

Tyr Val Arg Ala 

Phe Tyr Glu Glu 
160 

Gly He Tyr Asn 
175 

Trp Cys Pro Tyr 
.190 

Val Val Tyr Ser 
205 

Val Ser Lys Glu 

His Asn He Asp 
240 

Arg Arg Thr Leu 
255 

Thr Val Leu Arg- 
270 

Tyr Arg Arg Leu 
285 

Thr Asp Ala "His 

Glu Ala Val Pro' 
320 

Leu Arg Arg Leu 
335 

Val Val Gin Glu 
350 

Val Cys He Gin 
365 

Ser Leu Leu His 
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